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Preface 


This book gives a mathematical treatment of the basic ideas and results 
of logic. It is intended to serve as a textbook for an introductory mathema- 
tics course in logic at the junior-senior level. The objectives are to present 
the important concepts and theorems of logic and to explain their signi- 
ficance and their relationship to the reader’s other mathematical work. 

As a text, the book can be used in courses anywhere from a quarter to a 
year in length. In one quarter, I generally reach the material on models of 
first-order theories (Section 2.6). The extra time afforded by a semester 
would permit some glimpse of undecidability, as in Section 3.0. In a second 
term, the material of Chapter 3 (on undecidability) can be more adequately 
covered, 

The book is intended for the reader who has not studied logic pre- 
viously, but who has some experience in mathematical reasoning. There 
are no specific prerequisites aside from a willingness to function at a 
certain level of abstraction and rigor. There is the inevitable use of basic 
set theory. In Chapter 0 is a concise summary of the set theory used. One 
should not begin the book by studying this chapter; it is instead intended for 
reference if and when the need arises. The instructor can adjust the amount 
of set theory employed; for example it is possible to avoid cardinal numbers 
completely (but some good theorems are then lost). The book contains some 
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examples drawn from abstract algebra. But they are just examples, and are 
not essential to the exposition. The later chapters (Chapter 3 and 4) tend to 
be more demanding of the reader than are the earlier chapters. 
Induction and recursion are given a more extensive discussion (in Section 
1,2) than has been customary. I prefer to give an intuitive account of these 
subjects in lectures and have a precise version in the book rather than to 
have the situation reversed. 
Exercises (over 150 in all) are given at the end of nearly all the sections. 
If the exercise bears a boldface numeral, then the results of that exercise are __ 
used in the exposition in the text. Exercises of unusual difficulty are marked o 
with an ya List of Symbols 
I cheerfully acknowledge my debt to my teachers, a category in which I 
include also those who have been my colleagues or students. I would be pleas- 
ed to receive comments and corrections from the users of this book. 


The numbers indicate the pages on which the symbol first occurs. 
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Introduction 


Symbolic logic is a mathematical model of deductive thought. Or at 
least that was true originally; as with other branches of mathematics it 
has grown beyond the circumstances of its birth. Symbolic logic is a model 
in much the same way that modern probability theory is a model for situa- 
tions involving chance and uncertainty. 

How are models constructed? You begin with a real-life object, for 
example an airplane. Then you select some features of this original object 
to be represented in the model, for example its shape, and others to be 
ignored, for example its size. And then you build an object which is like 
the original in some ways (which you call essential) and unlike it in others 
. (which you call irrelevant). Whether or not the resulting model meets its 
intended purpose will depend largely on the selection of the properties of 
the original object to be represented in the model. 

Logic is more abstract than airplanes. The real-life objects aré certain 
“logically correct” deductions. For example, 


All men are mortal. 
Socrates is a man. 
Therefore, Socrates is ‘mortal. 


The validity of inferring the third sentence (the conclusion) from the first 
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two (the assumptions) does not depend on special idiosyncracies of Socrates. 
The inference is justified by the form of the sentences rather than by empir- 
ical facts about mortality. It is not really important here what “mortal” 


means; it does matter what “all” means. 


Borogoves are mimsy whenever it is brillig. 
It is now brillig, and this thing is a borogove. 
Hence this thing is mimsy. 


Again we can recognize that the third sentence follows from the first two, 
even without the slightest idea of what a mimsy borogove might look like. 

Logically correct deductions are of more interest than the above frivolous 
examples might suggest. In fact, axiomatic mathematics consists of many 
such deductions laid end to end. Thus the deductions made by the work- 
ing mathematician constitute real-life originals whose features are to be 
mirrored in our model. 

The logical correctness of these deductions is due to their form but is 
independent of their content. This criterion is vagne, but it is just this sort 
of vagueness which prompts us to turn to mathematical models, A major 
goal will be to give, within a model, a precise version of this criterion, 
The questions (about our model) we will initially be most concerned with 
are: 


1. What does it mean for one sentence to “follow logically” from certain 
others? 

2. If a sentence does follow logically from certain others, what methods 
of proof might be necessary to establish this fact? 


Actually we will present two models. The first (sentential logic) will 
be very simple and will be woefully inadequate for interesting deductions. 
Its inadequacy stems from the fact that it preserves only some crude prop- 
erties of real-life deductions. The second model (first-order logic) is ad- 
mirably suited to deductions encountered in mathematics. When a working 
mathematician asserts that a particular sentence follows from the axioms 
of set theory, he means that this deduction can be translated to one in 
our model. 

While our model is well suited to mathematics, it must be admitted that 
other models of deductive thought have been proposed for other purposes. 
For example, philosophers and mathematicians have considered so-called 
modal logic and intuitionistic logic, which represent different selections 
of properties of real-life deductions. In addition, the models have been 
generalized and studied for their own interest and for purposes unrelated 


Introduction 3 


to their original genesis. For example, one interesting topic of current 
study is logic with infinitely long sentences. l 

Thus far we have avoided giving away much information about what 
our model, first-order logic, is like. As brief hints, we now give some exam- 
ples of the expressiveness of its formal language. First, take the English 
sentence which asserts the principle of extensionality, “If the same things 


are members of a first object as are members of a second object, then those. . 


objects are the same.” This can be translated into our first-order language 


as ; 
Vx Vy(W2(z E x 4> z E y) => x = y) 


As a second example, we can translate “For every positive number € 
there is a positive number ô such that for any x whose distance from a 
is less than 6, the distance between f(x) and b is less than ” as 


Vele > 0 — 4d(6 >0 AWx(dxa < 6 — dfxb < e))). 


We have given some hints as to what we intend to do in this book. We 
should also correct some possible misimpressions by saying what we are 
not going to do. This book does not propose to teach the reader how to 
think. The word “logic” is sometimes used to refer to remedial thinking, 
but not by us. 


CHAPTER ZERO 


Useful Facts about Sets 


We assume that the reader already has some familiarity with normal 
everyday set-theoretic apparatus. Nonetheless, we give here a brief sum- 
mary of facts from set theory we will need; this will at least serve to establish 
the notation, It is suggested that the reader, instead of poring over this 
chapter at the outset, simply refer to it if and when issues of a set-theoretic 
nature arise in later chapters. 

A set is a collection of things, called its members or elements. As usual, 
we write “t e A” to say that £ is a member of A, and “t ¢ A” to say that 
tis not a member of A. We write “x = y” to mean that x and y are the 
same object. That is, the expression “x” on the left of the equals sign is a 
name for the same object as is named by the other expression “y If A = B, 
then for any object ¢ it is automatically true that te A iff 16 B. (The 
word “iff” means “if and only if.) This is simply because A and B are 
the same thing. The converse is the principle of extensionality: If A and B 
are sets such that for every object ¢, 


ted iff te B, ! 
then A = B. 
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A useful operation is that of adjoining one extra object to a set. For a 
set A, let A ; t be the set whose members are (i) the members of A, plus 
(ii) the (possibly new) member t. Here ¢ may or may not already belong 
to A, and we have 


tedA. iff Ast=A. 


One special set is the empty set @, which has no members at all. For 
any object x there is the singleton set {x} whose only member is x. Moré 
generally, for any finite number x,,...,.~x, of objects there is the set 
{x,,...,X,} whose members are exactly those objects. Observe that 
{x,y} = {y, x}, as both sets have exactly the same members. We have 
only used different expressions to denote the set. 

This notation will be stretched to cover some simple infinite cases. For 


example, {0,1,2,...} is the set N of natural numbers, and {..., —2, 
—1,0, 1,2, ...} is the set Z of all integers. 
We write “{x : ——x—}” for the set of all objects x such that _x_. 


We will take considerable liberty with this notation. For example, {<m, n> 
¿m <n in N} is the set of all ordered pairs of natural numbers for which 
the first component is smaller than the second. And {x€ A pean em as 
the set of all elements in A such that _x_. 

If A is a'set all of whose members are also members of B, then A is a 
subset of B, abbreviated “A S B.” Note that any set is a subset of itself. 
Also, Ø is a subset of every set. (“Z € A” is “vacuously true,” since the task 
of verifying, for every member of Ø, that it also belongs to A, requires 
doing nothing at all. Or from another point of view, “A S B” can be false 
only if some member of A fails to belong to B. If A = ©, this is impossible.) 
From the set 4 we can form a new set, the power set PA of A, whose 
members are the subsets of A. Thus 


PA={x:xS A}. 
For example, 
POS = {D} 
FLD} = {D, {Dy}. 
The union of A and B, A U B, is the set of all things which are members 
of A or B (or both), For example, A ; t = A U {t}. Similarly, the inter- 
section of A and B, A N B, is the set ofall things which are members of 


both A and B. A and Bare disjoint iff their intersection is empty. A collection. 
of sets is pairwise disjoint iff any two members of the collection are disjoint. 
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More generally, consider a set A whose members are themselves sets. 
The union of A, UA, is the set obtained by dumping all the members of 
A into a single set: 


UA = {x : x belongs to some member of 4}. 
Similarly, 
| QA = {x : x belongs to all members of A}. 
For example, if 


A = {{0, 1, 5}, {1, 6}, {1, 5}}, 
then 
= {0, 1, 5, 6}, 
NA = {1}. 
Two other examples are 
A U B = U{A, B}, 
UPA =A. 


‘In cases where we have a set A, for each natural number n, the union of 


all these sets, U{A, : n € N}, is usually denoted “U,ewA,” or just “UnAn.” 
The ordered pair <x, yò of objects x and y must be defined in such a 
way that 


<x, Y> = <u, v 


iffx=u and y=». 

Any definition which has this property will do; the standard one is 
<x, YD = {lx}, {ey}; 

For ordered Pots we define 
CX Vs 2) = KX, V), DD 

More generally we define n-tuples inductively by 

Xn?» Xn? | 


xy, tee ag as KETT aa 


for n > 1. It is convenient to define also <x) = x; the above equation then 


holds also for n = 1. S is a finite sequence of members of A iff for some 


positive integer n, we have S = <x,, ... , x,>, where each x; € A. (Finite 
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sequences are often defined to be certain finite functions, but the above 
definition is slightly more convenient for us.) 


A segment of the finite sequence S = <¢x,,..., Xp? is a finite sequence 


Cika Kriis ++ Amis Am where I<k<m<n. 

This segment is an initial segment iff k = | and it is proper iff it is different 
from S. $ , 
If Xis ee XO = Mi, -e Yad, then it is easy to see that x; = y; for 
| < į <n. (The proof uses induction on n and the basic property of ordered 
pairs.) But if <x, ...,%p> = io -s Yad then it does not in general 
follow that m = n. After all, every ordered triple is also an ordered pair. 
But we claim that m and n can be unequal only if some x; is itself a finite 

sequence of y,’s, or the other way around: 


Lemma OA Assume that <x, 2.6.5 Xm = Vio eeo Yms oo Vmak?” 
Then x, = Pis -s Vka 


Proof We use induction on m. If m = 1, the conclusion is immediate. 
For the inductive step, assume that (x1, ... 4 Xms Xm? = Pre > o Pmghs 
Yasi+e> Then the first components of this ordered pair must be equal: 
(Xp, oy Xm = Vis +++ Yme? Now apply the inductive hypothesis. a 


For example, suppose that A is a set such that no member of A is a finite 
sequence of other members. Then if (x1, ..., Xm? = (iy +++ Vad and each 
x; and y; is in A, then by the above lemma m = n. Whereupon we have 
x; = y; as well, l 

From sets A and B we can form the set A x B of all pairs <x, yò for 
which x € A and y € B. A” is the set of all n-tuples of members of A. For 
example, A3 = (4 x A) x A. 

A relation R is a set of ordered pairs. The domain of R, dom R, is the set 
of ali objects x such that <x, y) E€ R for some y. The range of R, ran R, 
is the set of all objects y such that <x, yò € R'for some x. The union of 
dom Reand ran R is the field of R, fid R. 

An nary reldtion on A is a subset of A”. If n > 1, it is a relation. But 
a l-ary (unary) relation on A iş just a subset of A. A particularly simple 
binary relation on A is the equality relation {<x, x> : x € A} on A. For 
an n-ary relation R on A and subset B of A, the restriction of R to B is the 
intersection R A B”, 

A function is a relation F with the property that for each x in dom F 
there is only one y such that <x, yò € F. As usual, this unique y is said to be 
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the value F(x) which F assumes at x. We say that F maps A into B and write 
F:4>B 


to mean that F is a function, dom F =.A, and ran F S B. If in addition 
ran F =.B, then F maps A onto B. F is one-to-one iff for each y in ran F 
there is only one x such that <x, yò e F. If the pair <x, y> is in dom F, 
then we let F(x, y) = F(x, y>). This notation is extended to n-tuples; 
F(x, wee » Xp) = Fx, tee jd): 

An n-ary operation on A is a function mapping A” into A, For example, 
addition is a binary operation on N, whereas the successor operation S 
(where S(7) = n + 1) is a unary operation on N. If fis an n-ary operation 
on A, then the restriction of f to a subset B of A is the function g with domain 
B” which agrees with f at each point of B”. Thus 


g=fo (B" x A), 


This g will be an n-ary operation on B iff B is closed under f, in the sense 
that /(b,, ..., bn) E B whenever each b; is in B. In this case, g = f AO Be, 
in agreement with our definition of the restriction of a relation. 

A particularly simple unary operation on A is the identity function i 
on A, given by the equation 


i(x) =x for xe A. 


For a relation R, we define the following: 


R is reflexive on A iff <x, x» € R for every x in A. 

R is symmetric iff whenever <x, yò € R, then also <y, x> E€ R. 

R is transitive iff whenever both <x, yò € R and <y, Z> € R, then also 
«x, 2>E R. 

R satisfies trichotomy on A iff for every x and y in A, exactly one of the 
three possibilities, <x, yò € R, x = y, or <y, x> E€ R, holds. 

R is an equivalence relation on A iff R is a binary relation on A which 
is reflexive on 4, symmetric, and transitive. 

R is an ordering relation on A iff R is transitive and satisfies trichotomy 
on A. 

For an equivalence relation R on A we define, for x € A, the equivalence 
class |x] of x to be {y : <x, y> € R}. The equivalence classes then partition 
A. That is, the equivalence classes are subsets of A such that each member 
of A belongs to exactly one equivalence class. For x and y in A, 


x] = >] iff x, wer. 


Useful Facts about Sets 9 


At a few points in the book we will use the axiom of choice. But usually 
these uses can be eliminated if the theorems in question are restricted to 
countable languages. Of the many equivalent statements of the axiom of 
choice, Zorn’s lemma is especially useful. 

Say that a collection C of sets is a chain iff for any elements x and y of 
C, either x S yoryex, 


Zorn’s Lemma Let A be a nonempty set such that for any chain C S A, 
the set UC is in A. Then there is some element m € A which is maximal 
in the sense that it is not a subset of any other element of A. 


The set N of natural numbers is the set {0, 1, 2, ...}. (It can also be 
defined set-theoretically, but we will not do so here.) A set A is finite iff 
there is some one-to-one function f mapping (for some natural number n) 
the set A onto {0,1,...,2— 1}. (We can think of f as “counting” the 
members of A.) 

A set A is countable iff there is some function mapping A one-to-one into 
N. For example, any finite set is obviously countable. Now consider an in- — 
finite countable set A. Then from the given function f mapping a one-to-one 
into N, we can extract a function f’ mapping A one-to-one onto N. For 
some ay E A, f(a) is the least member of ran f; let f'(a) = 0. In general 
there is a unique a, € A such that f(a,) is the (n + 1)st member of ran f; 
let f'(a) = n. Note that A = {a» a), ...}. (We can also think of f' as 
“counting” the members of A, only now the counting process is infinite.) 


Theorem 0B Let A be a countable set. Then the set of all finite sequences 
of members of A is also countable. 


Proof The set S of all such finite sequences can be characterized by 
the equation 
S= (ja. 


neN 


Since A is countable, we have a function f mapping A one-to-one into N. 

The basic idea is to map S one-to-one into N by assigning to <a), a4, ..., 
am? the number 2/lao+13sa41 . || pflam+1, where pp is the (m + 1)st 
prime. This suffers from the defect that this assignment might not be well- 
defined. For conceivably there could be (ay, dy, ... 5 am = Cbo, bis oo s DaDo 
with a; and b; in A but with m Æ n. But this is not serious; just assign to 
each member of S the smallest number obtainable in the above fashion. . 
This gives us a well-defined map; it is easy to see that it is one-to-one. 
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At times we will speak of trees, which can be useful in providing intuitive 
pictures of some situations. But our comments on trees will always be in- 
formal, as we exclude this concept from the theorems and their proofs. 
Accordingly, our discussion here of trees will be informal. i 

For each tree there is an underlying finite partial ordering. We can draw 
a picture of this partial ordering R; if <a, b> € R, then we put a lower than 
b and connect the points by a line. Two typical pictures of tree orderings 


os, we 
i ` : 
aE Ra Y 


(Perhaps the word “tree” should be replaced by “root” since our branching 
is downward, not upward.) There is always a highest point in the picture. 
Furthermore, while branching is permitted below some vertex, the points 
above any given vertex must lie along a line. 

In addition to this underlying finite partial ordering, a tree also has a 
labeling function whose domain is the set of vertices. For example, one 
tree, in which the labels are natural numbers, is 


Cardinal numbers 


All infinite sets are big, but some are bigger than others. (For example, 
the set of real numbers ts bigger than the set of integers.) Cardinal numbers 
provide a convenient, although not indispensable, way of talking about 
the size of sets. 

It is natural to say that two sets A and B have the same size iff there is a 
function which maps A one-to-one onto B. If A and B are finite, then this 
concept is equivalent-to the usual one: If you count the members of A 
and the members of B, then you get the same number both times. But it 
is applicable even to infinite sets A and B, where counting is difficult. 
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Formally, then, say that A and B are equinumerous (A ~~ B) iff there is a 
one-to-one function mapping A onto B. For example, the set N of natural 
numbers and the set Z of integers are equinumerous. It is easy to see that 
equinumerosity is reflexive, symmetric, and transitive, 

For finite sets we can use natural numbers as measures of size. The 
same natural number would be assigned to two finite sets (as measures 
of their size) iff the sets were equinumerous. Cardinal numbers are intro- 
duced to enable us to generalize this situation to infinite sets, i 

To each set 4 we can assign a certain object, the cardinal number (or car- 
dinality) of A (card A), in such a way that two sets are assigned the same 
cardinality iff they are equinumerous: 


(K) card A = card B iff A~ B. 
There are several ways of accomplishing this; the standard one these days 
takes card A to be the least ordinal equinumerous with A. (The success of 
this definition relies on the axiom of choice.) We will not discuss ordinals 
here, since for our purposes it matters very little what card 4 actually is, 
any more than it matters what the number 2 actually is. What matters 
most is that (K) holds. It is helpful, however, if for a finite set A, card A 
is the natural number telling how many elements A has. Something is a 
cardinal number, or simply a cardinal, iff it is card A for some set A . 

(Georg Cantor, who first introduced the concept of cardinal number, 
characterized in 1895 the cardinal number of a set M as “the gerieral concept 
which, with the help of our active intelligence, comes from the set M upon 
abstraction from the nature of its various elements and from the order of 
their being given.”) 

Say that A is dominated by B (A < B) iff A is equinumerous with a subset 
of B. In other words, A < B iff there is a one-to-one function mapping A 
‘nto B. The companion concept for cardinals is 


card A < card B iff A <B. 


(It is easy to see that < is well defined; that is, whether or not x < A de- 
pends only on the cardinals x and A themselves, and not the choice of sets 
having these cardinalities.) Dominance is reflexive,and transitive. A set A 
Is dominated by N iff A is countable. The following is a standard result 
in this subject, 


Schréder-Bernstein Theorem (a) For any sets A and B, if A < B 
and B < A, then A~ B. E 


(b) For any cardinal numbers x and A, if x < A and 4< x, then x = À. 
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Part (b) is a simple restatement of part (a) in terms of cardinal numbers, 


The following theorem, which happens to be equivalent to the axiom of — 


choice, is stated in the same dual manner. 


Theorem 0C (a) For any sets A and B, either A < Bor BX A. 
(b) For any cardinal numbers x and A, either x <A or Asm, 


Thus of any two cardinals, one is smaller than the other. (In fact, any 
nonempty set of cardinal numbers contains a smallest member.) The small- 
est cardinals are those of finite sets:0,1,2,.... There is next the small- 
est infinite cardinal, card N, which is given the name $. Thus we have 


Dl Zass Mr Ot 


where $Ñ, is the smallest cardinal larger than No. The cardinality of the 
real numbers, card R, is called “280,” Since R is uncountable, we have 
No < 250, . 

The operations of addition and multiplication, long familiar for finite 
cardinals, can be extended to all cardinals. To compute % -+ A we choose 
disjoint sets A and B of cardinality x and A, respectively. Then 


z + A= card (A U B). 


This is well defined; i.e., % + A depends only on x and A, and not on the 
choice of the disjoint sets A and B. For multiplication we use 


2x +A=card(A X B). 


Clearly these definitions are correct for finite cardinals. The arithmetic of 
infinite cardinals is surprisingly simple. The sum or product of two infinite, 
cardinals is just the larger of them: 


Cardinal Arithmetic Theorem For cardinal numbers x and A, if «<A 


and A is infinite, then x + 4 =A. Furthermore, if «540, then «- A= A, 


In particular, for infinite cardinals x, 
Noc x= x. 


Theorem 0D Foran infinite set A, the set U,,A”*! of all finite sequences 
of elements of A has cardinality equal to card A. 


We already proved this for the case of a countable A (see Theorem OB). 
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Proof Each A”*! has cardinality equal to card A, by the cardinal arith- 
metic theorem. So we have the union of $, sets of this size, yielding $, - card 
A = card A points altogether. | 


EXAMPLE It follows that the set of algebraic numbers has cardinality $4. 
First, we can identify each polynomial (in one variable) over the integers 
with the sequence of its coefficients. Then by the theorem there are Ne 
polynomials, Each polynomial has a finite number of roots. To give an 
extravagant upper bound, note that even if each polynomial had N, roots, 
we would then have $,- No = N, algebraic numbers altogether. Since 
there are at least this many, we are done. 

Since there are uncountably many (in fact, 28°) real numbers, it follows 
that there are uncountably many (in fact, 2°) transcendental numbers. 


Throughout the book we will utilize an assortment of standard math- 
ematical abbreviations. We have already used “m” to signify the end 
of a proof. A sentence “If..., then...” will sometimes be abbreviated - 
“...=>..,.” We also have “<=” for the converse implication. For “if 
and only if” we use, in addition to the word “iff,’ the symbol “<>,” 
For the word “therefore” we have the abbreviation “°,.” 

The notational device that extracts “x34 y” as the denial of “x = y” 
and “x ¢ y” as the denial of “x e y” will be extended to other cases. 
For example, on page 33 we define “S fr”; then “S x” is its denial. 


CHAPTER ONE 


Sentential Logic 


§ L0 INFORMAL REMARKS ON FORMAL LANGUAGES 


We are about to construct (in the next section) a language into which 
we can translate English sentences. Unlike natural languages (like English 
or Chinese), it will be a formal language, with precise formation rules. 
But before the precision begins, we will discuss here some of the features 
we want to incorporate into the language. 

As a first example, the English sentence “Traces of potassium were 
observed” can be translated into the formal language as, say, the symbol 
K. Then for the closely related sentence “Traces of potassium were not 
observed,” we can use (~K). Here ~ is our negation symbol, read as 
“not.” One might also think of translating “Traces of potassium were not 
observed” by some new symbol, e.g., J, but we will prefer instead to break 
such a sentence down into atomic parts as much as possible. For an unrelated 


sentence, ‘The sample contained chlorine,” we choose, say, the symbol C, °_ 


Then the following compound English sentences can be translated as the 
formulas shown at the right: 


14 
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If traces of potassium were observed, then the sample (K — (-C)) 
did not contain chlorine. 
The sample contained chlorine, and traces of potas- . (Ca K) 
sium were observed. 


In the second example we use our conjunction symbol A to translate 
“and.” The first example uses the more familiar arrow to translate “if ..., 
then... .” In the following example the disjunction symbol v is used 
to translate “or”: 


Either no traces of potassium were observed, or ((=K) v (=C)) 
the sample did not contain chlorine. 
The sample neither contained chlorine nor were (-(C v K)) 


traces of potassium observed, or 
(=C) a (>K)) 


In this last example we have given two alternative: translations. The 
relationship between them will be discussed later. 

One important aspect of the decompositions we will make of compound 
sentences is that whenever we are given the truth or falsity of the atomic 
parts, we can then immediately calculate the truth or falsity of the com- 
pound. Suppose, for example, that the chemist emerges from his laboratory 
and announces that he observed traces of potassium but-that the sample 
contained no chlorine. We then know that the four above sentences are 
true, false, true, and false, respectively. In fact, we can construct in advance 
a table giving the four possible experimental results (Table 1). We will 
return to the discussion of such tables in Section 1.3. 


TABLE I 
K Cc (Cv K)) (AC) A (9K) 
F F T T 
F T F F 
T F F F 
T T F F 


Use of formal languages will allow us to escape from the imprecision 
and ambiguities of natural languages. But this is not done without cost; 
our formal languages will have a sharply limited degree of expressiveness. 
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In order to describe a formal language we will generally give three pieces 
of information: 


1. We will specify the set of symbols (the alphabet). In the present case 
of sentential logic some of the symbols are 


C, ), =>, —, Å, Àz, Orr 


2. We will specify the rules for forming the “grammatically correct” 


finite sequences of symbols. (Such sequences will be called well-formed 


formulas or wffs.) For example, in the present case 


(A, > (“A,)) 
will be a wff, whereas 
))— A; 
will not. 


3. We will also indicate the allowable translations between English and 
the formal language. The symbols A,, A,,... can be translations of de- 
clarative English sentences. 


Only in this third part is any meaning assigned to the wffs. This process 
of assigning meaning guides and motivates all we do. But it will also be 
observed that it would theoretically be possible to carry out various ma- 
nipulations with wffs in complete ignorance of any possible meaning. A 
person aware of only the first two pieces of information listed above could 
perform some of the things we will do, but it would make no sense to him. 

Before proceeding, let us look briefly at another class of formal languages 
of widespread interest today. These are the languages used by (or at least 
in connection with) digital computers, 

There are many of these languages. In one of them a typical wff is 


0110101 10101000111110001000001 111010 
In another a typical wff is 
STEP#ADDIMAX,A. 


(Here # is a symbol called a blank; it is brought into the alphabet so that 
a wif will be a string of symbols.) A well-known language called Fortran 
has wffs such as 


DO#3#41=IMIN,IMAX, 
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In all cases there is a given way of translating the wffs into English, and 
(for a restricted class of English sentences) a way of translating from En- 
glish into the formal language. But the computer is unaware of the English 
language. An unthinking automaton, the computer juggles symbols: and 
follows its program slavishly. We could approach formal languages that 
way too, but it would not be much fun. 


§ 1.1 THE LANGUAGE OF SENTENTIAL LOGIC 


We assume we are given an infinite sequence of distinct objects_which ~ 
we will call symbols, and to which we now give names (Table ID. We 
further assume that no one Of these symbols is a finite sequence of other 
symbols. 

TABLE II 


Symbol Verbose name Remarks 


a aaa aaam aaaaaeaauħe 


( left parenthesis punctuation 

) right parenthesis punctuation 

~ negation symbol English: not 

A: conjunction symbol English: and 

Vv disjunction symbol English: or (inclusive) 
- ; conditional symbol English: if—, then— 
o> biconditional symbol English: if and only if 
Ay first sentence symbol 

Ag second sentence symbol 

An nth sentence symbol 


oe 


Several remarks are now in order: 
!. The five symbols 


“ly, A, V, AE > 


are called sentential connective symbols, Their use is suggested by the En- 
glish translation given above. The sentential connective symbols, together 
with the parentheses, are the logical symbols. In translating to and from 
English, they always play the-same role.The sentence symbols are the 


parameters (or nonlogical symbols). Their translation is not fixed; instead 


they will be open to a variety of interpretations, as we shall illustrate shortly. 
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2. We have included only countably many. sentence symbols. Most of 
the things we say in this chapter would apply equally well if we had allowed 
an arbitrary set of sentence symbols. (The exceptions are primarily in Sec- 
tion 1.7.) 

3, Some logicians prefer to call A, the nth proposition symbol (and to 
speak of propositional logic instead of sentential logic). This stems from 
wanting the word “sentence” to refer to a particular utterance, and want- ` 
ing a proposition to be that which a sentence asserts. 

4. We call these objects “symbols,” but we remain neutral as to what 
the exact ontological status of symbols might be. In the leftmost column 
of our list of symbols, names of the symbols are printed. For example, 
Az is one symbol, namely the two hundred forty-third sentence symbol. 
(On the other hand, “Az” is a name of that symbol. The conditional 
symbol may or may not have the geometric property of being shaped like 
an arrow, although its name “=>” does.) The symbols themselves can be 
sets, numbers, marbles, or objects from a universe of linguistic objects. 
In the last case, it is conceivable that they are actually the same things as 
the names we use for them. Another possibility, which will be exploited 
in the next chapter, is that the sentence symbols are themselves formulas 
in another language. 

5. We have assumed that no symbol is a finite sequence of other symbols. 
We mean by this that not only are the symbols listed distinct (¢.g., Ag Æ +>), 
but no one of them is a finite sequence of two or more symbols. For exam- 
ple, we demand that As Æ in, Ay, ©. (It is a fact of set theory that Ag 
Æ lm, Ag, (œ) The purpose of this assumption is to assure that finite 
sequences of symbols be uniquely decomposable. If 


<a, see > Am) = <by, reto’ b,> 


and each a; and each b; is a symbol, then m = n and a; = b;. See Chapter 
0, Lemma 0A, and subsequent remarks.) 


An expression is a finite sequence of symbols. We can specify an expres- 
sion by concatenating the names of the symbols; thus (~A) is the se- 
quence <(, ~, Aj,)>. This notation is extended: If œ and f are sequences 
of symbols, then æf is the sequence consisting first of the symbols of the 
sequence a followed by the symbols of the sequence £. 

For example, if @ and f are the expressions given by the equations 


a= (A,), 
B = Ag, 
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then (æ —> 8) is the expression 
((7A,) > A,). 


We should now look at a few examples of possibie translations of En- 
glish sentences into expressions of the formal language. Let A, B Z 
be the first twenty-six sentence symbols. (For example, E = As.) ' 


1. English: The suspect must be released from custody. Translation: R 

English: The evidence obtained is admissible. Translation: E di 

English: The evidence obtained is inadmissible. Translation: (E) 

English: The evidence obtained is admissible, and the suspect need aol 
be released from custody. Translation: (E A (—R)). 

English: Either the evidence obtained is admissible, or the suspect must 
be released from custody (or possibly both). Translation: (E v R). 

English: Either the evidence obtained is admissible, or the suspect must 
be released from custody, but not both. Translation: (CE v R) A (™@E AR))) 
We intend always to use the symbol v to translate the word ‘‘or” in 
its inclusive meaning of “and/or.” 

English: The evidence obtained is inadmissible, but the suspect need 
not be released from custody. Translation: ((7E) A (™R)). On the other 
hand, the expression ((—) v (—R)) translates the English: Either the 


evidence obtained is inadmissible or the suspect need not be released from 
custody. 


2. English: If wishes are horses, then beggars will ride. Translation: 
(W = B). 

English: Beggars will ride if and only if wishes are horses. Translation: 
(B <> W). 
; 3. English: This commodity constitutes wealth if and only if it is trans- 
erable, limited in supply, and either productive of pleasure or preventive 
ss Translation: (W < (Ta (LA (Pv Q)))). Here W is the trans- 
ation of “This commodity constitutes wealth.” Of course in the preceding 


example we used W to translate a different sentence. We are not tied to 
any one translation. 


One note of caution: De not confuse a sentence in the English language 
(Roses are red) with a translation of that sentence in the formal language 
Sy a These are different. The English sentence is presumably either 
o a S But the formal expression is just a sequence of symbols. It 
ai eed be interpreted in one context as a true (or false) English sentence, 

at it can have other interpretations in other contexts. 
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Now some expressions cannot be obtained as translations of any English 
sentences but are mere nonsense, like 


> (CAs. 


We want to define the well-formed formulas (wffs) to be the “grammatically 
correct” expressions; the nonsensical expressions must be excluded. The 
definition will have the following consequences: 


(a) Every sentence symbol is a wff. 

(b) If œ and 8 are wffs, then so are (ma), (œ AP), (œ v p), (œ — P), 
and (œ «> f). 

(c) No expression is a wff unless it is compelled to be one by (a) and (b). 


There are two equivalent ways of making this third property (about 
compulsion) precise. The first way defines the set of wffs “from the top 
down.” Let us say that a set S of expressions is inductive iff it has the 
properties 

(as) Every sentence symbol is in S. 

(bs) If expressions a and 8 are in S, then so also are (0), (œ a p), 


‘(av B), (œ — b), and (a +B). 


An expression is a well-formed formula (wff) iff it is a member of every 
inductive set. Then it is not hard to see that (a) and (b) are satisfied. As 
for (c), we can say that the set of all wffs is as small as it can be, in the sense 
that it is a subset of any other inductive set. 

The second (and equivalent) definition works “from the bottom up.” 
An expression is a wff if and only if it can be built up from the sentence 
symbols by applying some finite number of times the formula-building 
operations (on expressions) defined by the equations 


Ze) = (ma), 
F(a, B) = (æ a B), 
Fa, P) = (æ v P), 
S_(a, P) = (a A), 
Eo, P) = (a <> p). 
For example, ` 


((Ay A Ayo) —> (CAs) V (Ag <> Ag))) 
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is a wff, as can be seen by contemplating its ancestral tree: 


((Ar A Ao) > (mAs) V (Ag +> A5))) 


((7A3) v (Ag + A,)) 
ys Aa 
(Ay A Ajo) (mAs) (Ag 4> As) 


AN 


Ay Axo A; Ag As 


The tree illustrates the construction of the expression from four sentence 
syinbols by five applications of formula-building ‘operations. 

These two definitions of wffs are discussed in a more general setting in 
Section 1.2, Here we only note that any inductive set of wffs is actually 
the set of all wffs; this fact will be called the induction principle. This prin- 
ciple will receive much use in the coming pages. In the following example 
we use it to show that certain expressions are not wffs. 


EXAMPLE Any expression with more left parentheses than right paren- 
theses is not a wff, 


Proof The idea is that we start with sentence symbols (having zero left 
parentheses and zero right parentheses), and then apply formula-building 
Operations which add parentheses only in matched pairs. We can rephrase 
this argument as follows: The set of “balanced” wffs (having equal numbers 
of left and right parentheses) contains all sentence symbols and is closed 
under the formula-building operations. Thus the set of balanced wffs is 


Inductive; the induction principle then assures us that all wffs are bal- 
anced, 


EXERCISES 


1. Give three sentences in English together with translations into our for- 
pc language, The sentences should be chosen so as to have an interesting 
Structure, and the translations should each contain fifteen or more symbols. 


2. Show that there are no wffs of length 2, 3, or 6, but that any other 
€ngth is possible. 


ee Leta bea wif; let c be the number of places at which binary connective 
nals (A, Vv, =>, +») occur in g; let s be the number of places at which 
ence symbols occur in æ. Show that s = c +1. 
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§1.2 INDUCTION AND RECURSION 


Induction 


There is one special type of construction which occurs frequently both 


in logic and in other branches of mathematics. We may want to construct ._ 


a certain subset of a set U by starting with some initial elements of U, and 
applying certain ‘operations to them over and over again. The set we seek 
will be the smallest set containing the initial elements and closed under 
the operations. Its members will be those elements of U which can be 
built up from the initial elements by applying the operations a finite number 
of times. 

In the special case of immediate interest to us, U is the set of expressions, 
the initial elements are the sentence symbols, and the operations are Zas 


Z, etc. The set to be constructed is the set of wffs. But we will encounter - 


other special cases later, and it will be helpful to view the situation abstractly 
here. 

To simplify our discussion, we will consider an initial set B S U and a 
class F of functions containing just two members f and g, where 


f:UxU—>U and g:U>U. 


Thus fis a binary operation on U and g is a unary operation. (Actually 
F need not be finite; it will be seen that our simplified discussion here is, 
in fact, applicable to a more general situation. F can be any set of rela- 
tions on U, and in Chapter 2 this greater generality will be utilized. But the 
case discussed here is easier to visualize and is general enough to illustrate 
the ideas. For a less restricted version, see Exercise 3.) : 

If B contains points a and b, then the set C we wish to construct will 
contain, for example, 


b, f (b, D), g(a), S (g(a), F (b, D), gS la, S DD). 


Of course these might not all be distinct. The idea is that we are given 


certain bricks to work with, and certain types of mortar, and we want Co 


to contain just the things we are able to build. 


In defining C more formally, we have our choice of two definitions. : 
We can define it “from the top down” as follows: Say that a subset S of U- 
is closed under f and g iff whenever elements x and y belong to S, then so — 


do f(x, y) and g(x). Say that S is inductive iff Bc S and S is closed under 


fand g. Let C* be the intersection of all the inductive subsets of U; thus. 
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x e C* iff x belongs to every inductive subset of U. It is not hard to see 
(and the reader should check) that C* is itself inductive. Furthermore, C* 
is the smallest such set, being included in all the other inductive sets. 

The second (and equivalent) definition works “from the bottom up.” 
We want C to contain the things which can be reached from B by applying 
fand ga finite number of times. Temporarily define a construction sequence 
to be a finite sequence (Xp, ..., Xa? of elements of U such that for each 
i <n we have at least one of 


4 


x; E B, 
X= f (xj, Xa) for some j <i, k <i, 
x, = g(x;) for some j <i. 


Then let C, be the set of all points x such that some construction sequence 
ends with x. 

Let C, be the set of points x such that some construction sequence of 
length n ends with x. Then C, = B, 


CrEC, EQS PEI 


and Ca = U,C,. For example, e( f(a, f (b, b))) is in C; and hence in Cy, 
as can be scen by contemplating the tree shown: 


(f(a, f(b, by) 


| 
J(e, f(b, b) 


We obtain a construction sequence for g(f (a, f(b, b))) by squashing this 
tree into a linear ordering. i 


EXAMPLES 1. The natural numbers. Let U be the set of all real numbers, 
and let B = {0}. Take one operation S, where S(x) = x + 1. Then 


Cy = (01,2, ..-}.. 


The set C, of natural numbers contains exactly those numbers obtainable 
from 0 by applying the successor operation repeatedly. 
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2. The integers. Let U be the set of all real numbers; let B = {0}. This 
time take two operations, the successor operation S and the predecessor 
operation P: l 

Sx)=x+1 and P(x) =x—1. 


Now C} contains all the integers, 
Cy = {..., —2, -1, 0, 1, 2, ane 


Notice that there is more than one way of obtaining 2 as a member of Cy.’ 
For 2 is S(S(0)), but it is also S(P(S(S(0)))). 

3. The algebraic functions. Let U contain all functions whose domain 
and range are each sets of real numbers. Let B contain the identity function 
and all constant functions. Let F contain the operations (on functions) 
of addition, multiplication, division, and root extraction. Then C} is the 
class of algebraic functions. 


4. The well-formed formulas. Let U be the set of all expressions and 
let B be the set of sentence symbols. Let # contain the five formula-build- 
ing operations on expressions: &,, So BH, Z., and &,. Then Cy is 
the set of all wffs. 


At this point we should verify that our two definitions are actually 
equivalent, i.e., “that C* = C}. 

To show that C* S C, we need only check. that C, is inductive, i.e., 
that B S C, and C, is closed under the functions. Clearly B = Cı S Cy. 
If x and y are in Cx, then we can concatenate their construction sequences 
and append f(x, y) to obtain a construction sequence placing f(x, y) in 
Cx. Similarly, C,, is closed under g. 

Finally, to show that C,, < C* we consider a point in Cy and a con- 
struction sequence (x9, ...,%,> for it. By ordinary induction on i, we 
can see that x; e C*, i <n. First x» € B © C*. For the inductive step we 
use the fact that C* is closed under the functions. Thus we conclude that 


U Cr = Cy = C* = {S : S is inductive}. 
n 
(A parenthetical remark: Suppose our present study is embedded in 
axiomatic set theory, where the natural numbers are usually defined from 
the top down. Then our definition of C, (employing finiteness and hence 
natural numbers) is not really different from our definition of C*. But 
we are not working within axiomatic set theory; we are working within 
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intuitive mathematics. And the notion of natural number seems to be a 


solid, well-understood intuitive concept.) E 
Since C* = Cx, we call the set simply C and refer to it as the set generated 


from B by the functions in @ We will often want to prove theorems by 
using the following: , 


Induction Principle Assume that C is the set generated from B by the 
functions in # If S is a subset of C which includes B and is closed under. 


the functions of Z; then S = C. 


Proof S is inductive, so C = C* S S. We are given the other in- 


clusion. E 


The special case now of interest to us is, of course, Example 4, Here C 
is the class of wffs generated from the set of sentence symbols by the for- 
mula-building operations. This special case has interesting features of its 
own. Both a and £ are proper segments of Zla, p), i.e., of (œ A p). More 
generally, if we look at the family tree of a wif, we see that each constituent 
is a proper segment of the end product. 


(Ag V (Ag + Ag)) 


Suppose, for example, that we temporarily call an expression special if 
the only sentence symbols in it are among {A,, As, As} and the only 
connective symbols in it are among {—,—>}. Then no special wff requires 
A, or Z, for its construction. In fact, every special wff belongs to the set 
C, generated from {A,, Aj, As} by Za and &,. (For we can use the 
induction principle to show that every wff either belongs to C, or is not 
Special.) 


Recursion! 
We return now to the more abstract case. There is a set U (such as the 


set of all expressions), a subset B of U (such as the set of sentence symbols), 


‘The reader not already familiar with recursion is advised to postpone reading this 
Subsection until after reading Section 1.3, where a specific application is encountered. 
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and two functions f and g, where 
f:ux U—U and g:U>U. 


C is the set generated from B by f and g. 
The problem we now want to consider is that of defining a function on C 
recursively. That is, we suppose we are given 


1. Rules for computing A(x) for x € B. 
2a. Rules for computing AC/(x, y)), making use of f(x) and AO). 
2b. Rules for computing A(g(x)), making use of A(x). 


(For example, this is the situation discussed in Section 1.3, where Å is 
the extension of a truth assignment for B.) It is not hard to see that there 
can be at most one function Å on C meeting all the given requirements. 

But it is possible that no such h exists; the rules may be contradictory. 
For example, let 

U = the set of real numbers, 


B = {0}, 
SHV) =x: y, 
g(x) =x+1. 
Then C is the set of natural numbers. Suppose we impose the following 
requirements on Å: 7 
1. A) =0. 


2a. AC f(x, IY = Sa), AO). 
2b. Ae = h(x) + 2. 


Then no such function Å can exist. (Try computing /A(1), noting that we 
have both | = g(0) and 1 = f(g(0), g(0)). 

A similar situation is encountered in algebra.! Suppose that you have a 
group G which is generated from B by the group multiplication and inverse 
operation. Then an arbitrary map of B into a group H is not necessarily 
extendible to a homomorphism of the entire group G into H. But if G hap- 
pens to be a free group with set B of independent generators, then any 
such map is extendible to a homomorphism of the entire group. 


1 It is hoped that examples such as this will be useful to the reader with some algebraic 
experience, The other readers will be glad to know that these exarnples are merely illus- 
trative and not essential to our development of the subject. í 
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Say that C is freely generated from B by f and g iff in addition to the 
requirements for being generated we have 


1. fo and gg are one-to-one, and 
2. The range of fo, the range of gg, and the set B are pairwise disjoint. 
(Here fy and gg are the restrictions of f and g to C.) 


Recursion Theorem Assume that the subset C of U is freely generated 
from B by f and g, where 
f:U x U>U, 
g:U—U. 


Further assume that V is a set and F, G, and h functions such that 


h: B—> V, 
F:VxV-/Y, 
G:V—YV. 


Then there is a unique function 
RCV 
such that 
(i) For x in B, h(x) = A(x). 
(ii) For x, y in C, 
ACS, y) = FAR), AO), 
h(g(x)) = CH). 
Viewed algebraically, the conclusion of this theorem says that any map 
h of B into V can be extended to a homomorphism Å from C (with oper- 
ations f and g) into V (with operations F and G). , 
If the content of the recursion theorem is not immediately apparent, 


try looking at it chromatically. You want to have a function h which paints 
each member of C some color. You have before you 


1. h, telling you how to color the initial elements in B; 
2. F, which tells you how to combine the color of x and y to obtain 


the color of f(x,y) Ge. it gives A( f(x, y)) in terms of A(x) and A(y)); 


3. G, which similarly tells you how to convert the color of x into the 
color of glx). 
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` The danger is that of a conflict in which, for example, F is saying “green” 


but G is saying “red” for the same point (unlucky enough to be equal both | 
to f(x, y) and g(z) for some x, y, z). But if C is freely generated, then this : 


danger is avoided, 


EXxampLes Consider again the examples of the preceding subsection. 


1. B = {0} with one operation, the successor operation S, Then C is 
the set N of natural numbers. Since the successor operation is one-to-one, | 


C is freely generated from {0} by S. Therefore, by the recursion theorem, 


for any set V, any a € V, and any F: V —> V there is a unique Å : N — V : 


such that A(0) = a and A(S(x)) = F(A(x)) for each x€ N. For example, 
there is a unique Å: N— N such that A(0) = 0 and A(S(x)) = 1 — A(x). 


This function has the value 0 at even numbers and the value 1 at odd | 


numbers. 

2. The integers are generated from {0} vy the successor and predecessor 
operations but not freely generated. 

3. Freeness fails also for the generation of the algebraic functions in 
the manner described. 

4. The wffs are freely generated from the sentence symbols by the five 
formula-building operations. The purpose of Section 1.4 is to prove this 
fact. It follows, for example, that there is a unique function Å defined on 
the set of wffs such that 


h(A) = 1 for a sentence symbol A, 
h((0)) = 3 + h(a), 
h(a a B)) = 3 + Al) + hi), 


and similarly for v, —>, and <+>. This function gives the length of each wff. 


Proof of the recursion theorem The idea is to let A be the union of many 
approximating functions. Temporarily call a function v (which maps part 
of C into V) acceptable if it meets the conditions imposed on fA by (i) and 
(ii). More precisely, v is acceptable iff the domain of v is a subset of C, the 
range a subset of V, and 


(i) If x belongs to B and to the domain of v, then v(x) = A(x). 


(ii) If f(x, y) belongs to the domain of v, then so do x and y, and 
vf (x, y)) = F(x), v(y)). If a(x) belongs to the domain of v, then so does 
x, and v(g(x)) = G(v(x)). 
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Let K be the collection of all acceptable functions, and let Å be the union 

of K. Thus 
<x yeh iff v(x) = y for some ve K. 

We claim that A meets our requirements. We will outline the procedure 
for checking this, leaving many details to the reader. (We feel that a detailed 
understanding of this set-theoretic proof, while nice, is not essential here. 
But some understanding of its outline should make the theorem itself 
more comprehensible.) 


1. We claim that Å is a function. Let 
S= {xe C: For at most one y, <x, yò € Å}. 


It is easy to verify that S is inductive, by using (i') and (ii^). Hence S = C 
and Å is a function. l 
2. We claim that h € K; i.e., that Ā is an acceptable function. This fol- 
lows fairly easily from the definition of / and the fact that it is a function. 
3. We claim that Å is defined throughout C. It suffices to show that 
the domain of Å is inductive. It is here that the assumption of freeness is 
used. For example, one case is the following: Suppose that x is in the do- 
main of Å. Then Å ; g(x), G(A(x))> is acceptable. (The freeness is required 
in showing that it is a function.) Consequently, g(x) is in the domain of A. 
4, We claim that’/ is unique. For given two such functions, let S be the 
set on which they agree. Then S is inductive, and so equals C. C] 


It is interesting to note that there is an alternative way of describing the 
proof of the recursion theorem, by presenting the desired function Å as 
the set (of pairs) generated from a set by some operations. For let 


0=Ux ¥V, 
B=h, a subset of U, 
P(x, u>, <y, >) = <S (x, y), Elu, »)>, 
G(<x, u>) = <g(x), GU). 


Thus Ê is the binary operation on U obtained as the product of the opera- 
tions f and F. Now let A be the subset of U generated from Ê by Ê and Ĝ. 
Then it can be checked that A has the desired properties. The freeness must 
be used in-showing that Å is a function. 

One final comment on induction and recursion: The induction principle 
we have stated is not the only one possible. It is entirely possible to give 
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proofs by induction (and definitions by recursion) on the length of expres- 
sions, the number of places at which connective symbols occur, etc. Such 
methods are inherently less basic but may be necessary in some situations. 


EXERCISES 


1. Suppose that C is generated from a set B = {a, b} by the binary 
operation fand unary operation g. List all the members of C}. How many 
members might C, have? C,? 


2. Obviously (A, —> A A,) is not a wff. But prove that it is not a wff. 


3. We can generalize the discussion in this section by requiring of F 
only that it be a class of relations on U. C, is defined as before, except that 
{Xo, X1s +++, Xp? is now a construction sequence provided that for each 
i<n we have either x;¢ B or <x;,,..., Xjps XD E R for some REP 
and some ji, ... , ją all less than 7. Give the correct definition of C* and 
show that C* = C}. 


§1.3 TRUTH ASSIGNMENTS 


We want to define what it means for one wff of our language to follow 
logically from other wffs. For example, A, should follow from (A, A A3). 
For no matter how the parameters A, and A, are translated back into En- 
glish, if the translation of (A, A A.) is true, then the translation of A, 
must be true. But the notion of all possible translations back into En- 
glish is unworkably vague. Luckily the spirit of this notion can be expressed 
in a simple and precise way. 

Fix once and for all a set {T, F} of truth values consisting of two distinct 
points: 

T, called truth, 
F, called falsity. 


(it makes no difference what these points themselves are; they might as 
well be the numbers 1 and 0.) Then a truth assignment v for a set S” of sen- 
tence symbols is a function 


v: Z —{T, F} 


assigning either T or F to each symbol in ™. These truth assignments will 
be used in place of the translations into English mentioned in the preceding 
paragraph. 
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(At this point we have committed ourselves to two-valued logic. It is 
also possible to study three-valued logic, in which case one has a set of 
three possible truth values. And then, of course, it is a small additional step 
to allow 512 or $, truth values; or to take as the set of truth values the unit 
interval [0,.1] or some other convenient space. A particularly interesting 
case is that for which the truth values form a complete Boolean. algebra. 
But it is two-valued logic that has always had the greatest significance, and 
we will be content to confine ourselves to this case.) 

Let he the set of wis generated from Z by the “ve formula-building 
operations. (F can also be characterized as the set ci wffs whose sentence 
symbols are all in Z; see the remarks at the end of the subsection on induc- 
tion in Section 1.2.) We want an extension Ÿ of v, 


p: Ž— (T, F} 


which assigns the correct truth value to each wff in È It should meet the 
following conditions: 


0. For any A € P, (A) = v(A). (Thus 7 is an extension of v.) 
For any o, $ in Ž 

T if wa) = F, 

F otherwise. 

T if Ha) =T and #(f) =T, 

F otherwise. 

T if We) = T or P8) = T (or both), 

4 otherwise. 


Lo =] 
2 ea B)) = | 


3. K((a v PY 
F if (a) = T and 0(f) = F, 
T otherwise. 
T if pœ) = PP), 


F otherwise. 


4. 1(0 > p) = 


5. Keep = 4 


(Conditions 1-5 are given in tabular form in Table III, It is at this point 
that the intended meaning of, for example, the conjunction symbol enters 
into our formal proceedings. Note especially the intended meaning of —. 
Whenever œ is assigned F, then (a — f) is considered “vacuously true” 
and is assigned the value T. For this and the other connectives, it is certainly 
possible to question how accurately we have reflected the common meaning 
in everyday speech of “if ..., then,” “or,” etc, But our ultimate concern 
lies more with mathematical statements than with the subtle nuances of 
everyday speech.) 
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TABLE Ol 
& B =g aap avg æ — B ap 
T T F T T T T 
Toy; F F F T F F 
F T T F T T F 
F F T F F T T 


As an example of the calculation of #, let œ be the wff 
((Az — (Ay — Ag)) <> ((Ag A A1) > Ag)) 


and let v be the truth assignment for {A,, As, Ag} such that 


v(Ay) =T, 
vA) = T, 
v(A,) = F. 


We want to compute f(a). We can look at the tree which displays the con- 
struction of æ: 


((A; —> (A; > Ag)) T. ((Ag A Ay) > Ag)) 


(A, > (A, —> Ag)) ((Ag A Ay) — Ag) 
F F 


Working from the bottom up, we can assign to each vertex f of the tree 
the value #(8). So as a first step we compute — 


H(A, > Ag)) = F and (A, A) = T. 


Next we compute 0((A, —> (A, > S = F, and so forth. Jia at the 
top of the tree we arrive at P(e) = 
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Actually this computation can be carried out with much less writing. 
First, the tree can be given in starker form: 


And even this can be compressed into a single line (with the parentheses 
restored): 
(CA, —> (Ay > Ag) + (Az A A1) > A6)). 
T FT FF T TTT FF 


Theorem 13A For any truth assignment v for a set 7 there is a unique 
function # : 7— {T, F} meeting the aforementioned conditions 0-5. 


This theorem follows from the recursion theorem of Section 1.2 and the 
unique readability theorem of Section 1.4. But it should already seem ex- 
tremely plausible, especially in light of the preceding example. In proving 
the existence of #, the crucial issue will, in effect, be the uniqueness of the 
trees mentioned in the example. 

We say that a truth assigment v satisfies p iff D(p) = T. (Of course for 
this to happen, the domain of v must contain all sentence symbols in @.) 
Now consider a set X of wffs (thought of as hypotheses) and another wif r 
(thought of as a possible conclusion). 


Definition 2 tautologically implies t (X Lz) iff every truth assignment 
for the sentence symbols in Æ and r which satisfies every member of Pa 
also satisfies t. 


This definition reflects our intuitive feeling that a conclusion follows 
from a set of hypotheses, if the assumption that the hypotheses are true 
guarantees that the conclusion is true. 
` Several special cases of the concept of tautological implication deserve 
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mention. First, take the special case in which 2 is the empty set Ø. Observe 
that it is vacuously true that any truth assignment satisfies every member 
of Ø. (How could this fail? Only if there was some unsatisfied member of 
Ø, which is absurd.) Hence we are left with: @ + iff every truth assignment 
(for the sentence symbols in v) satisfies r. In this case we say that + is a 
tautology (} t). For example, the wff ((A, —> (A, — Ag)) +> ((A; A Ay) 
~> A,y)) was in a recent example found to be satisfied by one of the eight 
possible truth assignments for {A,, Az, Ag}. In fact, it is satisfied by the 
other seven as well, and hence is a tautology. 

Another special case is that in which no truth assignment satisfies every 
member of X. Then for any v, it is vacuously true that X } r. For example, 


{A, (7A)} EB. 


There is no deep principle involved here; it is just a by-product of our 
definitions. l 


EXAMPLE {A, (A —>B)} EB. There are four truth assignments for 
{A, B}. It is easy to check that only one of these four satisfies both A 
and (A —> B), namely the v for which »(A) = v(B) = T. This v also sat- 
isfies B. 


If X'is singleton {o}, then we write “ø H t” in place of “{o} t.” If both 
of tandtt o, then o and v are said to be tautologically equivalent (o = 4 v). 
For example, in Section 1.0 we encountered the wffs (~(C v K)) and 
(CC) A (™K)) as alternative, translations of an English sentence. We 
can now assert that they are tautologically equivalent. 

We can state here a nontrivial fact whose proof will be given later (in 
Section 1.7). 


Compactness Theorem Let X be an infinite set of wffs such that for 
any finite subset 2) of X, there is a truth assignment which satisfies every 
member of Xo. Then there is a truth assignment which satisfies every 
member of 2. 


This theorem can be restated more simply as: If every finite subset of 
& is satisfiable, then 2 itself is satisfiable. (The reader familiar with some 
general topology should try to discover why this is called “compactness 
theorem”; it does assert the compactness of a certain topological space. 
He should then prove the theorem for himself, using Tychonoff’s theorem 
on product spaces:) 
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Truth tables 


There is a systematic procedure, which we will now illustrate, for check- 
ing, given wffs o1, ..., 0p, and t, whether or not 


og ONES 


In particular (when k = 0), the procedure will, given a wff, decide whether 
or not it is a tautology. 
As a first example, we can show that 


C(A a B)) F (A) v (=B)). 


To do this, we consider all truth assignments for {A, B}. There are four 
such assignments; in general there are 2” truth assignments for a set of n 
sentence symbols. The four can be listed in a table: 


A B 
T T 
T F 
F T 
F F 


This table can then be expanded to include (~(A a B)) and (œA) v (~B)). 
For each formula we compute the 7’s and F’s the way described before, 
writing the truth value under the correct connective (Table IV). (The 
two leftmost columns of Table IV are actually unnecessary.) We can 
now see from this table that all. those truth assignments satisfying 


. (-(A A B)), and there are three such, also satisfy ((7A) v (=B)). In 


fact, the converse holds also, and thus 


(CA 4 B)) E 4 (C4) v (mB). 


TABLE IV 
A B C(A A B)) (CA) V.(B)) 
T T F TTT FT F FT 
T E l T TFF FTTTF 
F T > P FFT TF T FT 
F F T FFF TET TF 
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To show that (—(A a B)) Æ (7A) A (œ8)) we can construct the table 
as before. But only one line of the table is needed to establish that there 
is indeed a truth assignment satisfying (—(A A B)) which fails to satisfy 
(œA) A (>B)). 

The more generally applicable a Bieedure it is, the less efficient it is 
likely to be. For example, to show that 


E((A y (BA C)) > (Av B) A (Av C))), 


we could apply the truth-table method. But this requires eight lines (for 
the eight possible truth assignments for {A, B, C}). With a little cleverness 
the tedium can be reduced: 


((A v (BA C)) e (A y B) a (A v C))). 
TT T TT TTT 
FFFF T FFFFF 
FTTTT T FITT FTT 


In the first line we assumed only that »(A) = T. Since that is enough in- 
formation to obtain T for the wff, we assume in all later lines that »(A) = 
In the second line we assume that v(B) == F; this again lets us obtain T 
for the wff. So we may limit ourselves to the case (B) = T. Since the expres- 
sion is symmetric in B and C, we may further suppose (C) = T. This 
leaves only the third line, whereupon we are done. 
As an example of the nonuse of a sixteen-line table, consider the follow- 
ing tautology: 
(((@P a Q) —= R) > S) > (P > R) => 5)). 
T TT 
F T FFT 
TTT RRRET TRRRFEF 


Here in the first line we dispose of the case where u(S) = T. In the second 
line we dispose of the case where either vP) = F or v(Q) = F. The third 
line incorporates the two remaining possibilities; here R is the truth value 
assigned to R and R is the opposite value. 

For the above example it is possible to see directly why it is a tautology. 
The stronger the hypothesis, the weaker the conditional. Thus 


(Pa QFP, 
(P >R) F (P AQ) R), 
(PB a Q) > R) > S) H (P — R) = 5). 
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The problem of developing effective procedures which reduce the tedium 
is important for theorem proving by machine. Some of the programs here 
may require testing wffs of sentential logic having thousands of sentence 
symbols. Truth tables are far too cumbersome for anything this large. 
The problem of developing highly efficient methods is a current area of 
research in computer science. 


A SELECTED LIST OF TAUTOLOGIES 


1. Associative and commutative laws for A, V, «=>. 
2. Distributive laws: 


(AA (B v C)) + (A AB) v (A $ C))). 
(A v (BA C)) e (Av B)a (Av O). 
3. Negation: 
(((=A)) < A). 
(((A —> B)) + (A A (=7B))). 
(((A + B)) +> ((A A (B)) v (œA) ^ B))). 


De Morgan’s laws: 


(CA a B)) + (CA) v (=B))). 
(CA y B)) + (mA) A (B))). 
4, Other 


Excluded middle: (A v (—A)). 


Contradiction: (aCA A (mA). 
Contraposition: ((A — B) <= ((4B) —> (mA))). 
Exportation: (A 4 B) > C) + (A> (B — ©))). 


EXERCISES 


1. Show that neither of the following two formulas tautologically 


implies the other: 


(A+ B © C)), 
((A 4 (BA C)) v (A) A (HB) A (~7C)))). 
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2. Is ((P — Q)-—> P)— P) a tautology? 
3. Show that . 


(a) Z; Ep iff XE (>), 
(b) a F +8 iff Fæ <> 8). 
(Recall that X; œ = Z U {o}.) 


4, Prove or refute each of the following assertions: 


(a) If either X Hæ or 2 £, then X E (a v p). 
(b) If X E (æ v p), then either X Hæ or YEP. 


5. (a) Show that if v; and n, are truth assignments which agree on all 
the sentence symbols in the wff æ, then 7,(«) = Pa). 

(b) Let & be a set of sentence symbols which includes those in X and 
t (and possibly more). Show that X H«r iff every truth assignment for. 7 
which satisfies every member of X also satisfies t. 


6. You are in a land inhabited by people who either always tell the 
truth or always tell falsehoods. You come to a fork in the road and you 
need to know which fork leads to the capital. There is a local resident 
there, but he has time-only to reply to one yes-or-no question. What one 
question should you ask so as to learn which fork to take? 


7. (Substitution) Consider a sequence oy, @,,... of wffs. For a wif 
p, lei p* be the result of replacing the sentence symbol A,, by @,, for n = 
eee 


(a) Let v be a truth assignment for the set of all sentence symbols; define 
. u to be the truth assignment for which u(A,) = i(a,). Show that ay) = 
bp"). 
(b) Show that if p is a tautology, then so is g*. 
8. (Duality) Let œ be a wif whose only connective symbols are A, v, 
and =. Let a* be the result of interchanging A and v and replacing each 


sentence symbol by its negation. Show that @* is tautologically equivalent 
to (70). 


9. Say that a set 2; of wffs is equivalent to a set X, of wffs iff for any 
wif a, 2 æ iff ZF æ. A set £ is independent iff no member of & is tautol- 
ogically implied by the remaining members in X. Show that 


(a) A finite set of wffs has an independent equivalent subset. 
(b) A infinite set need not have an independent equivalent subset. 


*(c) Let X = {o9, 01, ...}; show that there is an independent equivalent ` 


set 2", 
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10. Show that a truth assignment v satisfies the wif 
Cee (Ay <> Ay) > +++ <> A,) 


if v(A;) = F for an even number of i’s, | <i<an, 


11. There are three suspects for a murder: Adams, Brown, and Clark. 
Adams says “I didn’t do it. The victim was an old acquaintance of 
Brown’s. But Clark hated him.” Brown states “I didn’t do it. I didn’t even 
know the guy. Besides I was out of town all that week.” Clark says “I 
didn’t do it. I saw both Adams and Brown downtown with the victim that 
day; one of them must have done it.” Assume that the two innocent men 
are telling the truth, but that the guilty man might not be. Who did it? 


§1.4 UNIQUE READABILITY 


The purpose of this section is to prove that we have used enough paren- 
theses to eliminate any ambiguity in analyzing wffs. (The existence of the 
extension # of a truth assignment v will hinge on this lack of ambiguity.) 

It is instructive to consider the result of not having parentheses at all. 
The resulting ambiguity is illustrated by the wff 


A; V A, A Ag, 


which can be formed in two ways, corresponding to ((A; V A,) A Ag) 
and to (A, V (A, A Ag)). If v(Ay) = T and v(A;) = F, then there is an 
unresolvable conflict which arises in trying to compute f(A, v A, A Ag). 

We must show that with our parentheses this type of ambiguity does not 
arise but that on the contrary each wff is formed in a unique way. There is 
one sense in which this fact is unimportant: If it failed, we would simply 
change notation until it was true. For example, instead of building formulas 
by means of concatenation, we could have used ordered pairs and triples: 
<n, a, <a, A, PX, etc. (This is, in fact, a tidy, but untraditional, method.) 
The unique readability theorem would then be immediate. But we do not 
have to resort to this device, and we will now prove that we do not. 


Lemma 14A Every wff has the same number of left as right parentheses. 


Proof This was done as an example at the end of Section 1.1. E 


.1 If the reader has already accepted the existence of #, then he may omit almost all of 
this section. The final subsection, on omitting parentheses, should still be read. 
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Lemma 14B Any proper initial segment of a wff contains an excess of 
left parentheses. Thus no proper initial segment of a wff can itself be a wif. 


Proof We show that the set S of wffs possessing the desired property 
(that proper initial segments are left-heavy) is inductive. A wff consisting 
of a sentence symbol alone has no. proper initial segments and hence is 
in S vacuously. To verify that S is closed under 2, consider œ and £ in S. 
The proper initial segments are 


LC 

2. A where a, is a proper initial segment of œ. 

3. (a. 

4, (an. 

5, (œ A fy, where fy is a proper initial segment of f. 

6. (@ AaB. 
By applying the inductive hypothesis that œ and f are in S (in cases 2 and 5), 
we obtain the desired conclusion. fa 


Unique Readability Theorem The five formula-building operations, when 
‘restricted to the set of wffs, 

(a) have ranges which are disjoint from each other and from the set of 
sentence symbols, and 

(b) are one-to-one. 


In the language of Section 1.2, this asserts that the set of wffs is freely 
generated from the set of sentence symbols by the five operations, . 


Proof To show that the restriction of & is one-to-one, suppose that 
(œ a p) = (y A ò), 


where a, 6, y, and ô are wffs. Delete the first symbol of each sequence, 


obtaining 
æ a p) = y ad). 


Then we must have a = y, lest one be a proper initial segment of the other 
(in contradiction with the preceding lemma). And then it follows at once 
that B = ô. The same argument applies to 3, &,, and. &,; for 3, a 

simpler argument suffices. 
A similar line of reasoning tells us that the operations have disjoint 
ranges. For example, if ; 
(a AB) = (y > 9), 


1 
| 
| 
T 
i 
! 


i 
l 
| 
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where g, p, y, and 6 are wffs, then as in the above paragraph we have o = y. 
But that implies that A = —, contradicting the fact that our symbols are 
distinct. Hence 3, and &, (when restricted to wffs) have disjoint ranges. 
Similarly for any two binary connectives. 

The remaining cases are simple. If (~œ) = (8 A y), then $ begins with 
=, which no wff does. No sentence symbol is a sequence of symbols be- 
ginning with (. 


Now let us return to the question of extending a truth assignment v to #. 
First consider the special case where v is a truth assignment for the set of 
all sentence symbols. Then by applying the unique readability theorem 
and the recursion theorem (of Section 1.2) we conclude that there is a 
unique extension Ÿ to the set of all wffs with the desired properties. 

Next take the general case where v is a truth assignment for a set F of 
sentence symbols. The set F generated from F by the five formula-building 
operations is freely generated, as a consequence of the unique readability 
theorem, So by the recursion theorem there is a unique extension @ of v 
to that set, having the desired properties. 


An algorithm 


Our proof of the unique readability theorem can be converted from a 
proof-by-contradiction into an algorithm which, given a wff, will produce 
its unique family tree. The algorithm has the further advantage that if it 
is given an expression which is not a wff, it will detect that fact. 

Assume that we are given an expression. Initially it is the only vertex 
in the tree (and so is the minimum one), but as the procedure progresses 
the tree will grow downward from the given expression. 


1. If all minimal vertices have sentence symbols, then the procedure is 
completed. Otherwise, select a minimal vertex which has an expression which 
is not a sentence symbol. 

2. The first symbol must! be (. If the second symbol is the negation 
symbol, skip to step 4. 

3. Scan the expression from the left until first reaching (a, where œ is 
an expression having a balance between left and right parentheses.2 Then 
@ is the first constituent. The next symbol must! be A, v, —>, or +> and 
is the principal connective. The remainder of the expression, £), must! 


Tf not, then the original expression was not a wff. 
f ? If the end of the expression is reached before finding such ana, then the original expres- 
Sion is not a wf. 
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consist of an expression f and a right parenthesis. The second constituent is 
B. This completes the decomposition of selected expression; return to step 1. 
4. If the second symbol is the negation symbol, then that is the principal 
connective. The remainder of the expression, f), must! consist of an ex- 
pression $ and a right parenthesis. £ is the constituent. This completes the 
decomposition of the selected expression; return to step 1. 


Now for some comments about the algorithm. First we claim that given 
any expression, the procedure halts after a finite number of steps. This is 
because any vertex contains a shorter expression than the one above it, 
so the depth of the tree is bounded by the length of the given expression. 

Second, we should remark on the uniqueness of the procedure. For 
example, in step 3 we arrive at an expression a We could not use less than 
a for a constituent, for it would not have a balance between left and right 
parentheses, We could not use more than g, for that would have the proper 
initial segment o that was balanced. Thus is forced upon us. And then the 
choice of the principal connective is inevitable. 

It is clear that if our algorithm is given a wif, it will not use the foot- 
notes requiring the expression to be rejected. Conversely, suppose the 
expression given is such that the procedure does not reject it. Then, by 
working our way up the resulting tree, we discover inductively that every 
vertex has a wif, including the top vertex (which has the given expression). 

We can also use the tree to see how i(a) is obtained. For any wff « there 
is a unique tree constructing it. By working our way up this tree, we can 
unambiguously arrive at a value for (æ). 


Polish notation 


It is possible to avoid both ambiguity and parentheses. This can be done 
by a very simple device. Instead of, for example, (œ A B) we use Aap. 
Let the set of P-wffs be the set generated from the sentence symbols by 
the five operations 

DAa) = 70, Bia, P) = yap, 
Dar, B) = AaB, Bo, 8) TA a, 
Ba, P) = «aß. 


For example, one P-wff is 
—-AADV=B<CB. 


1 Jf not, then the original expression was not a wif. 


err OAR 
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Here the need for an algorithm to analyze the structure is quite apparent 
Even for the short example above, it requires some thought to see how it 
was built up. We will give a unique readability theorem for such expressions 
in Section 2.3. 

This way of writing formulas (but with N, K, A, C, and E in place of ~ 
A, y, —, and <+>, respectively) was introduced by the Polish logician aie 
siewicz. The notation is well suited to automatic processing, Computer 


compiler programs often begin by converting the formulas given them into 
Polish notation. 


Omitting parentheses 


Hereafter when naming wffs, we will not feel compelled to mention ex- 


plicitly every parenthesis. To establish a more compact notation, we now 
adopt the following conventions: 


1. The outermost parentheses need not be explicitly mentioned. For 
example, when we write “A A B” we are referring to (A a B). 

2; The negation symbol applies to as-little as possible. For example, 
=A A B is (4A) AB, i.e., (A) a B). It is not the same as (A A B)). 
3, The conjunction and disjunction symbols apply to as little as pos- 
sible, given that convention 2 is to be observed. For example, 


AAB-> Cv D is (Aa B) > ((C) v D). 
oy Where one connective symbol is used repeatedly, grouping is to the 
ardBarayisarn(Pay), 
a> By is a> (By). 
It must be admitted that these conventions violate what was said on page 


18 about naming expressions. We can get away with this only because we 
no longer have any interest in naming expressions which are not wffs. 


EXERCISES 


? L. Rewrite the tautologies in the “selected list” at the end of Section 1.3, 
ut using the conventions of the present section to minimize the number 
of parentheses. 


2. Give an example of wffs œ and $ and expressions y and 6 such that 
(@ AB) = (y a ô) but ay. 
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3. Suppose that we modify our definition of wff by omitting all right 
parentheses. Thus instead of 


((A a (=B)) > (C v D)) 


we use 
((A a (œB — (C y D. 


Show that we still have unique readability. Suggestion: These expressions 
have the same number of parentheses as connective symbols. 


4. The English language has a tendency to use two-part connectives: 
“both... and...,” “either... Or... 2) Sif oag then a. .” How does 
this affect unique readability in English? 


5, We have given an algorithm for analyzing a wif by constructing its 
tree from the top down. There are also ways of constructing the tree from 
the bottom up. This can be done by looking through the formula for inner- 
most pairs of parentheses. Give a complete description of an algorithm 
of this sort. 


§ 4.5 SENTENTIAL CONNECTIVES 


We have thus far employed five sentential connective symbols. Even in 
the absence of a general definition of “connective,” it is clear that the five 
familiar ones are not the only ones possible. Would we gain anything by 
adding more connectives to the language? Would we lose anything by 
omitting some we already have? 

In this section we make these questions precise and give some answers. 
First consider an informal example. We could expand the language by adding 
a three-place sentential connective symbol 7, called the’ majority symbol. 
We allow now as a wf the expression (faBy) wheriever œ, B, and y are 
wiis. In other words, we add a sixth formula-building operator to our list: 


Fo, B, y= (aby). 


Then we must give the interpretation of this symbol. That is, we must 
say how to compute I((g-aßy)), given the values pa), PE), and vy). We 
choose to define 


ö((żaßy)) is to agree with the majority of D(a), PB), Hy). 
We claim that this extension has gained us nothing, in the following precise 


sense: For any wif in the extended language, there is a tautologically equiv- 
alent wif in the original language. (On the other hand, the wif in the orig- 


| 
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a language may be much longer than the wff in the extended language.) 
Ne will prove this (in a more general situation) below; here we just note 
at it relies on the fact that (#4aBy) is tautologically equivalent to 


(œ a B) v (œ a y) Y BAY). 


(We note parenthetically that our insistence that #((#efy)) be calculable 
from <i(«), PP), H(y)> plays a definite role here. In everyday speech, there 
are unary operators like “it is possible that” or “I believe that.” We can 
ae sa these operators to a sentence, producing a new sentence whose 
ruth or falsity cannot be de i is 
oo A ca termined solely on the basis of the truth or 

In generalizing the foregoing example, the formal language will be 
more of a hindrance than a help. We can restate everything using onl 
functions. say that a k-place Boolean function is a function from tr rji 
into tT, F}. (A Boolean function is then anything which is a k-place Boeie 
function for some k. We stretch this slightly by permitting T and F he 
selves to O-place Boolean functions.) Some sample Boolean fun tic 
are defined by the equations (where X e {T, F} = 


TNs eh Xa) = Xi, 
NT)=F, MF)=T, 
K(T, T) =T,  K(F, X) = K(X, F) = F, 
AF, F) =F, A(T, X) = A(X, T) =T, 
C(T, F) =F,  C(F, X) = CŒ(X, T) =T, 
E(X, X) =T, BE SEE T) = F. 


From a wf œ we can extract a Boolean function. For example, if æ is 
the wff A, A A,, then we can make a table, Table V. The 2? lines of the 


table correspond to the 2? truth assignments for {A,, A,}. For each of the 


2 ipo V 
2? pairs ¥, we set BX) equal to the truth value œ receives when its sen- 
tence symbols are given the values indicated by Y. 


TABLE V 
Ay As Ay A Ag 
F F F BF, F) = F 
F T F BAF, T) = F 
T F F BAT, F) = F 
T T T BAT, T) = T 
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In general, suppose that œ is a wif whose sentence symbols are at most 
Ay, ..., An. We define an n-place Boolean function B (or just B, if n 
seems unnecessary), the Boolean function realized by a, by 


BY(X,, ...,X,) = the truth value given to œ when 
Ay, ..-,A, are given the values X,,..., Xn: 


Or, in other words, BU(X,, ..., Xn) = Pæ), where v is the truth assignment 
for {Ay,...,; An} for which v(A,) = X;. Thus Bz comes from looking at 
D(a) as a function of v, with æ fixed. 

For example, the Boolean functions listed previously are obtainable in 
this way: ; 
i = Bay 


N = BLA,» 

K= BA LAAs’ 
A= BiwAy> 
C = BY Ans 
E = Bisa,: 


From these functions we can compose others. For example, 
BZA ,v=A (Xi > X) = A(NT(S > X))} NIX ()))» 


(The right-hand side of this equation can be compared with the result of 
putting “A, Y “A, into Polish notation.) We will shortly come to the 
question whether every Boolean function is obtainable in this fashion, 

As the theorem below states, in shifting attention from wffs to the Boolean 
functions they realize, we have in effect identified tautologically equivalent 
wffs. Impose an ordering on {T, F} by defining F < T. (If T = 1 and F = 0, 
then this is the natural order.) 


Theorem 15A Leto and f be wffs whose sentence symbols are among 
Ay, ..+> Ay. Then * 

(a) |= 6 iff for all Xe {T, F}, BAX) < BAX). 

(b) a 48 iff B, = Bp. 

(c) Ha ifferan B, = {T}. 


Proof of (a) œ}ß iff for all 2% truth assignments v for Ay, ... 
whenever #(a) = T, then also #(6) = T. (This is true even if the sentence 
symbols in œ and 8 do not include all of A,,..., An; ef. Exercise 5 of 
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Section 1.3.) Thus 


at B iff for all 2” assignments, v, Ha) = T = 88) = T, 
iff for all 2” n-tuples ¥, BUX) = T= n(X) =T, 


iff for all 2" n-tuples ¥, BUY) < Bx(X), 
where F < T. 5 l 


In addition to identifying tautologically equivalent wffs, we have freed 
ourselves from the formal language. We are now at liberty to consider any 
Boolean function, whether it is realized by a wff or not. But this freedom is 
only apparent: . 


Theorem 15B Let G be an n-place Boolean function, n > 1. We can 
find a wff œ such that G = BY, i.e., such that œ realizes the function G. 


Proof Case I: ran G = {F}. Let a = A, A “iA. 
Case Il: Otherwise there are k points at which G has the value T, k > 0. 
List these: 
x, = (Xas Xr, -- 
X, = [Xas Aos + 


ia Xin? 
s$ Xon?» 


X, = (Xia Xia, ony Xin? 


Let 
py= |e! if Xy =T, 
` | (mA) if Xy= F, 
yi = Pa Atte A Bins 


a = Yi V Ya V tte V Yke 
We claim that G = Bè}. , 


At this point it might be helpful to consider a concrete example. Let G 
be the three-place Boolean function as follows: 


G(F, F, F) = F, 
G(F, F, T) = T, 
G(F, T, F) =T, 
G(F, T, T) = F, 
G(T, F, F) = T, 
G(T, F, T) = F, 
G(T, T, F) = F, 
G(T, T, T) =T. 
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Then the list of triples at which G assumes the value T has four members: 


FFT “=A, A 7A, A Ag, 
FTF =Å; A A, A Ag, 
TFF A, AA, A As, 
TIT Ay A Ag A Ag. 


To the right of each triple above is written the corresponding conjunction 
yı. Then « is the formula 


(7A, A mA, A A) V (A, A Ay A AQ) Vv (Ay A Ag A AS) V (AY A Ay A Ag). 


Notice how æ lists explicitly the triples at which G assumes the value T. 

To return to the proof of the theorem, note first that B(X) = T for 
1<i<k. (For the truth assignment corresponding to ¥; satisfies y; 
and hence satisfies œ.) On the other hand, only one truth assignment for 
l {A,,..., A,} can satisfy y;, whence only k such truth assignments can 
satisfy a. Hence B%(Y) = F for the 2” — k other n-tuples Y. Thus in all 
casés, Bu(Y) = G(Y). 


From this theorem we know that every Boolean function is realizable. 
Of course the æ which realizes G is not unique; any tautologically equivalent 
wff will also realize the same function. It is sometimes of interest to choose 
a to be as short as possible. (In the example done above, the wif 


Ay 4> A, A; 
also realizes G.) 

As a corollary to the above theorem, we may conclude that we have 
enough (in fact, more than enough) sentential connectives. For suppose that 
we expand the language by adding some exotic new sentential connectives 
(such as the majority connective discussed at the beginning of this section). 
Any wif » of this expanded language realizes a Boolean function B3. By 
the above theorem we have a wff œ of the original language such that 
B? = B}. Hence p and g are tautologically equivalent, by Theorem 15A. 

In fact, the proof shows that «œ can be of a rather special form. For one 
thing, the only sentential connective symbols in @ are A, v, and =., Fur- 
thermore, « is in so-called disjunctive normal form. That is, 


A= Yi V tee V Yrs 
where 
Yi = pan A Bin 
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and each fy is a sentence symbol or a negation of a sentence symbol. 
(The advantages of wffs in disjunctive normal form stem from the fact 
that they explicitly list the truth assignments satisfying the formula.) Thus 
we have ` 


Corollary 15C For any wff p, we can find a tautologically equivalent 
wif œ in disjunctive normal form. 


Because every function G : {T, F}" > {T, F} for n > 1 can be realized 
by a wif using only the connective symbols in {a, v, =}, we say that the 
set {A, Y, =} is complete. (Actually the completeness is more a property 
of the Boolean functions K, A, and N which correspond to these symbols. 
But the above terminology is convenient.) Once we have a complete set of 
connectives, we know that any wff is tautologically equivalent to one all 
of whose connectives are in that set. (For given any wif p, we can make 
œ using those connectives and realizing Bo- Then a +.) The complete- 
ness of {A, v, =} can be improved upon: 


Theorem ISD Both {—, a} and {~, v} are complete. 


Proof We must show that any Boolean function G can be realized by 
a wif using only, in the first case, {—, a}. We begin with a wff a using 
{—, A, v}, which realizes G. It suffices to find a tautologically equivalent 
a’ which uses only {~, A}. For this we use De Morgan’s law: 


BY YEA AB A my), 


By applying this repeatedly, we can completely eliminate v from g. 

(More formally, we can prove by induction on æ that there is a tauto- 
logically equivalent ' in which only the connectives A, ~ occur. Two cases 
in the inductive step are 


Case =: Ifa is (8), then let o be (=’). 
Case v: If a is (8 v y), then let a’ be —(—f' A =y’). Since f’ and y' 
are tautologically equivalent to 8 and y, respectively, 
w= CH Ah) 
F 4 (8 A my) 
FABvy 


== aM, 


In future proofs that a set of connectives is complete, this induction will 
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be omitted. Instead we will just give, for example, the method of simulating 
v by using ~ and A.) 


Showing that a certain set of connectives is not complete is usually more 
difficult than showing that one is complete. The basic method is first to 
show (usually by induction) that for any wff « using only those connectives, 
the function B? has some peculiarity, and second to show that some Boolean 
function lacks that peculiarity. 


EXAMPLE {A, >} is not complete. 


Proof The idea is that with these connectives, if the sentence symbols 
are assigned T, then the entire formula is assigned T. In particular, there is 
nothing tautologically equivalent to “À. 

In more detail, we can show by induction that for any wf œ using only 
these connectives and having A as its only sentence symbol, we bave AF a. 
(In terms of functions, this says that ¥ < BUX).) 


For each n there are 22” n-place Boolean functions. Hence if we identify 
a connective with its Boolean function (e.g., a with the function K men- 
tioned before), we have 22" n-ary connectives. We will now catalog these 


for n <2. 


O-ary connectives 


There are two 0-place Boolean functions, T and F. For the corresponding 
connective symbols we take T and L. Now an n-ary connective symbol 
combines with n wffs to produce a new wff. When n = 0, we have that 
Lis a wf all by itself. It differs from the sentence symbols in that #(L) = F 
for every v; i.e., L is a logical symbol always assigned the value F. Similarly, 
T is a wif, and #(T) = T for every v. Then, for example, A 1 is a wif, 
tautologically equivalent to “A, as can be seen from a two-line truth table. 


Unary comectives 


There are four unary connectives but only one of any interest. The in- 
teresting case is negation. The other three one-place Boolean functions 
are the identity function and the two constant functions. 


Binary connectives 


There are sixteen binary connectives, but only the last ten listed in Table 
VI are “really binary.” 


nso eee OO EER 
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TABLE VI 
Symbol Equivalent Remarks 
T two-place constant, essentially O-ary 
L two-place constant, essentially O-ary 
A projection, essentially unary 
B projection, essentially unary 
mA negation, essentially unary 
~B negation, essentially unary 
A AAB and; if T = 1 and F = 0, then this gives mul- 
tiplication in the field {0, 1} 
v AVB or 
_ A — B conditional 
«= A 4 B biconditional 
4 B A reversed conditional 


4- (A V B) A “(å AB) exclusive or, “A or B and not both”; if T= 1 
and F = 0, then this gives the usual addition 
(modulo 2) in the field {0, 1} 


4 “(AV B) i nor, “neither A nor B” 


| —(A A B) nand, “not both A and B”; the symbol is cal- 
led the Sheffer stroke 

< (MA) AB the usual ordering, where F < T 

> A A (7B) the usual ordering, where F < T 


Ternary connectives 


There are 256 ternary connectives; 2 are essentially O-ary, 6 (= 2 - (#)) are 
essentially unary, and 30 (= 10- @)) are essentially binary. This leaves 
218 which are really ternary. We have thus far mentioned only the majority 
connective #. There is, similarly, the minority connective. In Exercise 7 
we encounter +-%, ternary addition modulo 2. --°afy is assigned the value 
T iff an odd number of a, $, and y are assigned T. This formula is equivalent 
both toa +8 + y and to g <>f 4y. Another ternary connective arises 
in Exercise 8. ' 


EXAMPLE {|} and {} } are complete. 


Proof, for | 
mak dala 


av BE = (-0)| (A). 


- Since {~, v} is complete and ~=, v can be simulated using only I, {| } is 


complete. 2 
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EXAMPLE {=, =} is complete. In fact, of the ten connectives which are 


really binary, eight have the property of forming, when added to =, a 
complete set. The two exceptions are + and +»; see Exercise 6. 


Exampce {1,->} is complete. 


EXERCISES 


1, Let G be the following three-place Boolean function: 


G(F, F, F)=T, G(T, F, F)=T, 
GF, F, T)=T, GT, F, T)=F, 
G(F,T, F) =T, G(T,T,F)= F, 
GIET, T) =F, GU, T, T) = F. 


Find a wff, using at most the connectives v, A, and =, which realizes G. 
Then find such a wff in which connective symbols occur at not more than 
five places. 


2. Show that | and | are the only binary connectives which are complete 
by themselves. 


3. Show that {7, #} is not complete. 


4, Let M be the ternary minority connective. (Thus i#(MaBy) always 
disagrees with the majority of O(a), 0(6), and d(y).) Show that 

(a) {M, L} is complete. 

(b) {M} is not complete. 


5. Show that {T, L, =, 4>, +} is not complete. Suggestion: Show that 
any wff œ using these connectives and the sentence symbols A and B has 
an even number of T’s among the four possible values of i(a). 


6. Show that {A, +>, +} is complete but that no proper subset is com- 
plete. 


7. Let 4-3 be the ternary connective such that +-%aBy is equivalent to 
a+Bt+ y. 

(a) Show that {T, L, a, +°) is complete. 

(b) Show that no proper subset is complete. 


8. Let 1 be the ternary connective such that læfy is assigned the value 
T iff exactly one of the formulas g, p, y is assigned the value T, Show that 


| 
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there are no binary connectives o and a such that fæßy is equivalent to 
(ao B)ay. 


9, Add the 0-place connectives T, L to our language. For any wif » 
and sentence symbol A, let p4 be the wff obtained from by replacing A 
by T. Similarly for p4 Then let p4 = (p4 v p4). Show that 


(a) PE pk. 

(b) If pH y and A does not appear in y, then of E y. 

(c) (Interpolation theorem) If œ } £, then there is some y all of whose 
sentence symbols occur both in œ and in £ and such thata E y F 2. 


§1.6 SWITCHING CIRCUITS! 


Consider an electrical device (traditionally a black box) having n inputs 
and one output (Fig. 1). Assume that to each input we apply a signal having 
one of two values and that the output has one of two values. The two pos- . 
sible values we call T and F. (We could also define the F value as 0 potential 
and choose the unit of potential so that the T value has potential 1.) Further 


Xi -—— 
Xi F (Xi, Xo, X3) 
X3 ~ Jæ 

Figure 1. Electrical device with three inputs. 


assume that the device has no memory; i.e., the present output level depends 
only on the present inputs (and not on past history). Then the performance 
of the device is described by a Boolean function: 


F(X,, ..., Xp) = the output level given the input signals X,,..., Xp. 

Devices meeting all these assumptions constitute an essential part of 
digital-computer circuitry. There is, for example, the two-input AND gate, 
for which the output is the minimum of the inputs (where F < T). This 
device realizes the Boolean function K of the preceding section. It is con- 


venient to attach the labels A, and A, to the inputs and to label the output 


Ay A Ay. 


1 This section, which discusses an application of the ideas of previous sections, may 
be omitted without loss of continuity. 


54 l. SENTENTIAL LOGIC 

Similar devices can be made for other sentential connectives. For a two- 
input OR gate (Fig. 2) the output voltage is the maximum of the input 
voltages. Corresponding to the negation connective there is the NOT device 
(or inverter), whose output voltage is the opposite of the input voltage. 


Figure 2. OR gate. 


A circuit can be constructed from various devices of this sort. And it 
is again natural to use wffs of our formal language to label the voltages at 
different points (Fig. 3). Conversely, given the wff thus attached to the 
output, we can approximately reconstruct the circuit, which looks very 
much Jike the tree of the wif’s formation. 


Figure 3. Circuit with wfls as labels. 


For example, the circuit for 
(A A B) AD) v (A AB) a =C) 


would probably be as shown in Fig. 4. Duplication of the circuit for A a B 
would not usually be desirable. 


Tautologically equivalent wffs yield circuits having ultimately the same ; 


performance, although possibly at different cost and (if the devices are not 
quite instantaneous in operation) different speed. Define the delay of a cir- 
cuit as the maximum number of boxes through which the signal can pass 
in going from an input to the output. The corresponding notion for formulas 
is conveniently defined by recursion. 
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Figure 4, Circuit for (A A B) A D) V (CAAB)A =C). 


1. The delay of a sentence symbol is 0. 
2. The delay of ~æ is one greater than the delay of a. 
3. The delay of œ a B is one i 
greater than the maxi 
and the delay of £. Baa 


And similarly for any other connective. 


For example, the circuit of (A, A Ap) V Ay uses three devices and has 
a delay of 2. The tautologically equivalent formula “(Az A (MA, V ~A,)) 
gives a circuit having five devices and a delay of 4. The problem ae ans 
a computer engineer is: Given a circuit (or its wif), find an equivalent circuit 
(or a tautologically equivalent wff) for which the cost as a minimum, sub- 
ae to constraints such as a maximum allowable delay. For this probed: 
eee catalog of available devices; for example, he might have 


NOT, two-input AND, three-input OR. 


ut is desirable that the available devices correspond to a complete set of 
connectives.) The catalog of devices determines a formal language, having 
a connective symbol for each device. = 


EXAMPLE 1 Inputs: A, B, C. Output: To agree with the majority of 


A, B, and C. Devices available: two-input t o-inp it A D, One 
OR 
sol > WwW g f N i 


(AAB)v (AA C))v (BA C), 
Which uses five devices and has a delay of 3. But a better solution is 


(A 4 (Bv C)) y (BA C), 
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which uses four devices and has the same delay. Furthermore, there is 
no solution using only three devices; cf. Exercise 1. 
l A D 
EXAMPLE 2 Inputs: A and B. Output: T if the inputs agree, F otherwise; 
i.e., the circuit is to test for equality. Device available: two-input NOR. 
One solution is | c 
((A } A) } B) } (BB) A). | 
Sp E 


This uses five devices; is there a better solution? A deeper question is: Is | 
there a systematic procedure for finding a minimal solution? These are 
questions which we merely raise here. In recent years a great deal of work | 
has gone into investigating questions of this type. 


Figure 6. Bridge circuit. 


| F, T, T>, <F,T, F, F>, <F, T, F, T>, <F,T,T, FD, <F, T, T, T), and <T, F, 

EXAMPLE 3 (Relay switching) Inputs: A, “A, B; =B, ... . Devices: F, TX. G is to have the value F at <T, F, F, F), <T, F, T, F), <T, T, F, FD, 
OR (any number of inputs), AND (any number of inputs), Cost: Devices <T, T, T, FY, and <T, T, T, TY. At the remaining three points, (F, F, F, F), 
are free, but each use of an input costs one unit. To test for equality of A <T, F, T, TX, and <T, T, F, TY, we don’t care about the value of G. (The 
and B we could use „application of the circuit is such that these three combinations never 

occur.) 

We know that G can be realized by using, say, {A, v, œ}. But we want 
to do this in an efficient way. The first step is to represent the data in a 
more comprehensible form, We can do this by means of Fig. 7. Since 
G(F, F, F,T)= T, we have placed a T in the square with coordinates 
(~A, ~B, =C, D>. Similarly, there is an F in the square with coordinates 


(A a B) y (mA A =B). 


The wiring diagram for the circuit is shown in Fig. 5. The circuit will pass 
current iff A and B are assigned the same value. (This formula, equivalent 
to A <+> B, has the property that its truth value changes whenever the value 
of one argument changes. For this reason, the circuit is used, with double- 
throw switches, in wiring hallway lights.) 


A san Bme 


-=A -B 


Figure 5. Wiring diagram for (A A B) V (5A A ~B). 


But there is one respect in which relay circuits do not fit the description 
given at the beginning of this section. Relays are bilateral devices; they 
will pass current in either direction. This feature makes “bridge” circuits 
possible (Fig. 6). The methods described here do not apply to such circuits. 


ppw -O = ae Eas 


Figure 7. Diagram for Example 4. 


EXAMPLE4 There are four inputs, and the circuit is to realize the Boolean 
function G, where G is to have the value T at <F, F, F, T), <}, F, T, F > lE, 
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<A, B, ~C, ~D) because G(T, T, F, F) = F. The three squares we do not 


care about are left blank. 
Now we look for a simple geometrical pattern. The shaded area includes 
all T’s and no F’s. It corresponds to the formula 


(mA) v (œC a D), 


which is reasonably simple and meets all our requirements. Note that the 
input B is not needed at all. 


EXERCISES 


1. In Example 1 of this section, verify that there is no solution using only 
three devices. 


2, Define a literal to be a wff which is either a sentence symbol or the 
negation of a sentence symbol. An implicant of p is a conjunction of 
literals (using distinct sentence symbols) such that œ Fy. We showed in 
Section 1.5 (cf. Corollary 15C) that any satisfiable wif œ is tautologically 
equivalent to a disjunction a, V -++ væ., where each di is an implicant 
of p. An implicant œ of g is prime iff it ceases to be an implicant upon the 
deletion of any of its literals. Any disjunction of implicants equivalent to p 
clearly must, if it is to be of minimum length, consist only or prime im- 
plicants. 


(a) Find all prime implicants of 
(A => B) a (“A C). 
(b) Which disjunctions of prime implicants enjoy the property of being 
tautologically equivalent to the formula in part (a)? 


3. Repeat (a) and (b) of Exercise 2, but for the formula 
(Av =B) a (~C v D) > B a (Aad C) y (œC a D)). 


§ 1.77 COMPACTNESS AND EFFECTIVENESS 


Compactness 


We now give a proof of the compactness theorem mentioned earlier 
(Section 1.3). Call a set X of wffs satisfiable iff there is a truth assignment 
which satisfies every member of 2. 


MTN 
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Compactness Theorem A set of wffs is satisfiable iff every finite subset 
is satisfiable. 


Let us temporarily say that Z is finitely satisfiable iff every finite subset of 
2’ is satisfiable. Then the compactness theorem asserts that this notion 
coincides with satisfiability. Notice that if X is satisfiable, then automatically 
it is finitely satisfiable. Also if X is finite, then the converse is trivial. (Every 
set is a subset of itself.) The nontrivial part is to show that if an infinite 
set is finitely satisfiable, then it is satisfiable. 


Proof of the compactness theorem The proof consists of two distinct 
parts. In the first part we take our given finitely satisfiable set X and extend 
it to a maximal such set A. In the second part we utilize A to make a truth 
assignment which satisfies >. 

For the first part-let œ, œ, ... be a fixed enumeration of the wits. (This 
is possible since the set of sentence symbols, and hence the set of expressions, 


is countable; see Theorem OB.) Define by recursion (on the natural numbers) - 


0 = x, 
oe An 3 Ony if this is finitely satisfiable, 
mE VAR Spa otherwise. 


(Recall that A, 341 = An U {o41}.) Then each 4, is finitely satisfiable; 
see Exercise 1. Let A = (),4,, the limit of the 4,’s. 

It is clear that (1) Z S A and that (2) for any wf @ either a € A or 
(œw) € A. Furthermore, (3) A is finitely satisfiable. For‘any finite subset is 
already a finite subset of some A, and hence is satisfiable. 

This concludes the first part of the proof; we now have a set A having 
properties (1)~(3). There is in general not a unique such set, but there is 
at least one. (An alternative proof of the existence of such a A—and one 
that we can use even if there are uncountably many sentence symbols—em- 
ploys Zorn’s lemma. The reader familiar with uses-of Zorn’s lemma should 
perceive its applicability here.) 

For the second part of the proof we define a truth assignment v for the 
Set of all sentence symbols: 


WA)=T iff AEA 
for any sentence symbol A. Then for any wff p, we claim that 


v satisfies o Wf ped. 
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This is proved by induction on p; see Exercise 2. Since X S A, v must 
then satisfy every member of X. 3 


Corollary 17A. If Xt, then there is a finite 2 S X such that Qo ET. 
Proof We use the basic fact that X} + iff 2’; ~t is unsatisfiable. 


Qi ET for every finite 2, 2 
=> J ; œr is satisfiable for every finite Za S 2 
=X ; ~r is finitely satisfiable 
=> ; ~r is satisfiable 
= X% T. E 


In fact, the above corollary is equivalent to the compactness theorem; 
see Exercise 3. 


Effectiveness 


Although the method of truth tables is rather cumbersome to use, the 
existence of the method yields interesting theoretical conclusions. Suppose 
we ask of a set X of wffs whether or not there is an effective procedure which, 
given a wif r, will decide whether or not 2’ t. By an effective procedure we 
mean one meeting the following conditions: 


1. There must be exact instructions, finitely long, explaining how to 
execute the procedure. These instructions should demand no cleverness on 
the part of the person (or machine) following them. The idea is that your 
secretary (who knows no mathematics) or your computing machine (which 
does not think at all) should be able to execute the procedure by mechanically 
following the instructions. 

2. The procedure must avoid random devices (such as the flipping of a 
coin), or any such device which can, in practice, only be approximated. 

3. In the case of a decision procedure, as asked for above, the procedure 
must be such that, given a wif r, after a finite number of steps the procedure 
produces a “‘yes” or “no” answer. 


On the other hand, we place no bound in advance on the number of 
steps required. Nor do we place any advance bound on the amount of 
scratch paper that might be required. These will depend on, among other. 
things, the input r. But for any one t, the procedure is to require only a 
finite number of steps to produce the answer, and so only a finite amount 
of scratch paper will be consumed. 
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Of course the above description can hardly be considered a precise 
definition of the word “effective.” And, in fact, that word will be used only 
in an informal intuitive way throughout this book. (In Chapter 3 we will 
meet a precise counterpart, “recursive. ”) But as long as we restrict ourselves 
to positive assertions that there does exist an effective procedure of a certain 
sort, the intuitive approach suffices. We simply display the procedure, show 
that it works, and people will agree that it is effective. (But this relies on the 
empirical fact that procedures which appear effective to one mathematician 
also appear so to others.) If we wanted a negative result, that there did 
not exist an effective procedure of a certain sort, then this intuitive view- 
point would be inadequate. (In Chapter 3 we do want to obtain just such 
negative results.) Because the notion of effectiveness is intuitive, definitions 
and theorems involving it will be marked with a star. For example: 


*Theorem 17B There is an effective procedure which, given any expres- 
sion e, will decide whether or not it is a wf. 


Proof See the algorithm in Section 1.4 and the footnotes thereto. 


*Definition A set 2 of expressions is decidable iff there exists an effective 
procedure which, given an expression œ, will decide whether or not a € X. 


For example, any finite set is decidable. Some infinite sets are decidable 
but not all. For there are 28° sets of expressions but only countably many 
effective procedures. This is because the procedure is completely determined 
by its (finite) instructions. There are only N, finite sequences of letters. 


*Theorem 17C There is an effective procedure which, given a finite 
set X; r of wffs, will decide whether or not X Ft. 


Proof The truth-table procedure (Section 1.3) meets the requirement. 


In this theorem we specified that X ; t was finite, since one cannot be 
“given” in any direct and effective way all of an infinite object: 


*Corollary 17D For a finite set X, the set of tautological consequences of 
& is decidable. In particular, the set of tautologies is decidable. 


If X is an infinite decidable set, then in general its set of tautological 
consequences may not be, decidable. (See Chapter 3.) But we can obtain 
a weaker result, which is in a sense half of decidability. 

Say that a set A of expressions is effectively enumerable iff there is an 
effective procedure which lists, in some order, the members of A. If A is 
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infinite, then the procedure can never finish. But for any specified member 

of A, it must eventually (i.e., in a finite length of time) appear on the list. 
Primarily to give the reader more of a feeling for this notion, we now 

state two elementary results about it. 


*Theorem 17E A set A of expressions is effectively enumerable iff 


there is an effective procedure which, given any expression e, produces 


the answer “‘yes” iff ee A. 


(If « ¢ A, the procedure might produce the answer “no”; it might go on 
forever without producing any answer, but it must not produce the answer 


“‘ves,’’) 


Proof If A is effectively enumerable, then given any e we can examine 
the listing of A as our procedure churns it out. If and when e appears, we 
say “yes.” (Thus if e ¢ A, no answer is ever given. It is this that keeps 4 
from being decidable. When e has failed to occur among the first 10*° 
enumerated members of A, there is in general no way of knowing whether 
e ¢ A (in which case one should give up looking) or whether e will occur 
in the very next step.) 

Conversely, suppose that we have the procedure described in the theorem. 
We want to create a listing of A. The idea is to enumerate all expressions, 
and to apply our given procedure to each. But we must budget our time 
sensibly. It is easy enough to enumerate effectively all expressions: 


Els Egs Egy sre s 


Then proceed according to the following scheme: 


1. Spend one minute testing e, for membership in A (using the given 
procedure), 

2. Spend two minutes testing ¢,, then two minutes testing €z. 

3. Spend three minutes testing ¢,, three minutes testing e, and three 
minutes testing £z. 


And so forth. Of course whenever our procedure produces a “yes” 
answer, we put the accepted expression on the output list. Thus any member 
of A will eventually appear on the list. (It will appear infinitely many times, 
unless we modify the above instructions to check for duplication.) a 


*'Theorem 17F A set of expressions is decidable iff both it and its 
complement (relative to the set of all expressions) are effectively enu- 
merable. 


Se 
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-Proof Exercise 4. 


Observe that if sets A and B are effectively enumerable, then so are A U B 
and A N B. The class of decidable sets is also closed under union and 
intersection, and it is in addition closed under complementation. 

Now for a more substantive result: 


*Theorem 17G If is a decidable set of wffs, then the set of tautological 
consequences of X is effectively enumerable. 


Proof Actually it is enough for X to be effectively enumerated; consider 


an enumeration . 
Ois 02, Og, +++ 


Given any wff r, we can test (by truth tables) successively whether or not 
OEt, 
{o} H T, 
{01, 02} = T, 


{o,, 02, 03} a T, 


and so forth. If any of these conditions is met, then we answer “yes.” 
Otherwise, we keep trying. 

This does produce an affirmative answer whenever SE r, by the corollary 
to the compactness theorem. a 


EXERCISES 


1. Assume that every finite subset of Z is satisfiable. Show that the same 
is true of at least one of the sets © ;a and Z ; ~w. (This is part of the 
proof of the compactness theorem.) 


2. Let A be a set of wffs such that (i) every finite subset of A is satisfiable, 
and (ii) for every wff a, either œ € A or (~a) € A. Define the truth assign-, 
ment v: 

T if Aed, 
WA) =| if Ag A 


for each sentence symbol A. Show that for every wif p, Pp) = Tiff ọ € A, 
(This is part of the proof of the compactness theorem.) 
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3. Show that from the corollary to the compactness theorem we can prove 


the compactness theorem itself (far more easily than we can starting from 
scratch). 


4, Prove Theorem 17F. 


*5. The notions of decidability and effective enumerability can be ap- 
plied not only to sets of expressions but also to sets of integers or to sets of 
pairs of expressions or integers. Show that a set A of expressions is effectively 
enumerable iff there is a decidable set B of pairs <a, n> (consisting of an 
expression a and an integer n) such that A = dom B. 


6. Let X be an effectively enumerable set of wffs. Assume that for each 
wif t, either X t or XE ~rt. Show that the set of tautological consequences 
„of 2' is decidable. 


EEN 


CHAPTER TWO 


First-Order Logic 


§2.0 PRELIMINARY REMARKS 


In the preceding chapter we presented the first of our mathematical 
models of deductive thought. It was a simple model, indeed too simple. 
It is easy to think of examples of intuitively correct deductions which can- 
not be adequately mirrored in the model of sentential logic. l 

Suppose we begin with a collection of hypotheses (in English) and a 
possible conclusion. By translating everything to the language of sentential 
logic we obtain a set X of hypotheses and a possible conclusion t. Now if 
X} +, then we feel that the original English-language deduction was valid. 
But if X £ r, then we are unsure. It may well be that the model of sentential 
logic was just too crude to mirror the subtlety of the original deduction. 

In this chapter we present a system of logic of much greater ability. In 
fact, when the “working mathematician” finds a proof, he almost invari- 
ably means a proof that can be mirrored in the system of this chapter. 

First, we want to give an informal description of the features our first- 
order languages might have (or at least might be able to simulate). We 
begin with a special case, the first-order language for number theory. For 
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TABLE VII 


Formal expression ‘Intended translation 


` 


-0 “zero.” Here 0 is a constant symbol, intended to name 
the number 0, 


Sr “the successor of ¢.” Here S is a one-place function 
` symbol, ¢ is to be an expression which names some 

number a. Then Sf names S(a), the successor of a. 

For example, S0 is intended to name the number 1. 


LVP “v, is less than v.” Here < is a two-place predicate 
symbol. At the end of Section 2.1 we will adopt con- 
ventions letting us abbreviate the expression in the 
more usual style: v; < va. 


Vv “for every natural number.” The symbol V is the universal 
quantifier symbol. More generally, with each translation 
of the language into English there will be associated 
a certain set A (the so-called universe); V will then 
become “for every member of the universe A,” 


Vo, <0r, “For every natural number v, , zero is less than »,.” Or 
more euphoniously, “Every natural number is larger 
than 0.” This formal sentence is false in the intended 


translation, since zero is not larger than itself, 


this language there is a certain intended way of translating to and from 
English (Table VII). 

One abbreviation is mentioned in Table VII. There will be more (Table 
VID. 


TABLE VIM 


Abbreviated expression Fo Intended translation 


xaoy “x equals y” In unabbreviated form this will become 
IXY. 

dv “there exists a iatairal number v such that.” Or more 
generally, “there exists a member of the universe such 
that.” 


dy, Yrs 0, Fz na “There is exactly one natural number.” Again this formal 


sentence is false in the intended translation. 


Vv,(0 < vi V 0 vp) “Every natural number is greater than or equal to zero.” 
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Actually we will not be quite as generous as the tables might suggest. 


There are two economy measures that we can take to obtain simplification 


without any essential loss of linguistic expressiveness. 

First, we choose as our sentential connective symbols just = and -». 
We know from Section 1.5 that this is a | complete set, so there is no real 
reason to use more. 

Second, we forego the luxury of an existential quantifier, 4x. In its place 
we use: —Wx-. This is justified, since an English sentence, l 


There is something rotten in the state of Denmark, 
is equivalent to 
It is not the case that for every x, x is not wetted in the state of Denmark. 
Thus the formula Jv, Vv, vy = v, becomes, in unabbreviated from, 
(Vn (Vv, ~0402)). 


For an example in an ad hoc language, we might translate “Socrates 
is a man” as Hs, where H is a one-place predicate symbol intended to trans- 
late “is a man” and s is a constant symbol intended to name Socrates. 


` Similarly, to translate “Socrates is mortal” we take Ms. Then “All men are 


mortal” is translated as: Wo (Hv, > Mo). 

The reader will possibly recognize the symbols V and 3 from previous 
mathematical contexts. Indeed, some mathematicians, when writing on the 
blackboard during their lectures, already use a nearly formalized language 
with only vestigial traces of English. That our first-order languages resemble 
theirs:is no accident. We want to be able to take one step back and study 
not, e.g., sets or groups, but the sentences of set theory or group theory. 
(The term metamathematics is sometimes used; the word itself suggests 
the procedure of stepping back and examining what the mathematician is 
doing.) The objects you, the logician, now study are the sentences which you, 
the set theoretician, previously used in the study of. sets. This requires 
formalizing the language of set theory. And we want our formal lenguages 
to incorporate the features used in, for example, set theory. 


§2.1 FIRST-ORDER LANGUAGES 


We assume henceforth that we have been given infinitely many distinct 


_ Objects (which we call symbols), arranged as follows: 


: 
i 
f 
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A. Logical symbols 


0. Parentheses (, ). 
1. Sentential connective symbols: =>, =. 
2: Variables (one for each positive integer n): 


Vp, Pa, 000 
3. Equality symbol (optional): =~ 


B. Parameters 
0. Quantifier symbol: W. 
l. Predicate symbols: For each positive integer n, some set (possibly 
empty) of symbols, called -place. predicate symbols. 
2. Constant symbols: Some set (possibly empty) of symbols. 
3. Function symbols: For each positive integer n, some set (possibly 
empty) of symbols, called n-place function symbols. 


In A.3 we allow for the possibility of the equality symbol’s being pres- 
ent, but we do not assume its presence. Some languages will have it and 
others will not. The equality symbol is a two-place predicate symbol but is 
distinguished from the other two-place predicate symbols by being a logical 
symbol rather than a parameter. (This status will affect its behavior under 
translations into English.) We do assume that some n-place predicate symbol 
is present for some n. 

In B.2, the constant symbols are also called 0-place function symbols. 
This will often allow a uniform treatment of the symbols in B.2 and B.3. 

As before, we assume that the symbols are distinct and that no.symbol 
is a finite sequence of other symbols. 

In order to specify which language we have before us (as distinct from 
other first-order languages), we must (i) say whether or not the equality 
symbol is present, and (ii) say what the parameters are. 

We now list some examples of what this language might be: 


1. Pure predicate language 


Equality: No. 

n-place predicate symbols: A}, AZ, .... 
Constant symbols: a;, a,,.... 

n-place function symbols (n > 0): None. 


2. Language of set theory 
Equality: Yes (usually). 


l 
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Predicate parameters: One two-place predicate symbol e€. 
Function symbols: None (or occasionally a constant symbol @). 


3. Language of elementary number theory (as in Chapter 3) 
Equality: Yes. . 
Predicate parameters: One two-place predicate symbol <. 
Constant symbols: The symbol 0. 
One- place function symbols: S (for successor). 
Two-place function symbols: + (for addition), - 
and Æ (for exponentiation). 


(for multiplication), 


In examples 2 and 3 there are certain intended translations of the param- 
eters. We will presently give a number of examples of sentences that can 
be translated into these languages and a few examples of sentences that 
cannot be so translated. . 

It is important to notice that our notion of language includes the language 
for set theory. For it is generally agreed that, by and large, mathematics 
can be embedded into set theory. By this is meant that l 


(a) statements in mathematics (like the fundamental theorem of calculus) 
can be expressed in the language of set theory; and 

(b) the theorems of mathematics follow logically from the axioms of 
set theory, 


Our model of first-order logic is fully adequate to mirror this procedure. 


ExamrLes in the language of set theory Here it is intended that V 
should mean “for all sets” and € should mean “is a member of,” 


L “There is no set of which every set is a member.” We will translate 
this into the language of set theory using several steps. The intermediate 
sentences are neither in English nor in the formal language but are in a 
mixed language. 


™[There is a set of which every set is a member] 
“dv,[Every set is a member of »,] 
“11d, Vg v, E€ o 


Although it is tempting to stop here, we must now replace v, € v, by EP304, 
since predicate symbols will always go at the left in such contexts. Fur- 
thermore, Jv; must be replaced by “Voy, as mentioned earlier. And we 
Must use the correct number of parentheses. The finished product is 


(=(AV0,(“V 0, €v,0,))). 
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2. Pair-set axiom: “For any two sets, there is a set whose members 
are exactly the two given sets.” Again we carry out the translation in 
stages. 

Vv,Vv,[There is a set whose members are exactly v, and vəl, 

Vv, Vv, Avg{The members of v are exactly v, and vl, 

Wo Vv, Av, Voal Ya E Vg > 04 = V V Vy Va). 

Now we replace Iv, by “Wr, t4 E V by €ryv,, and vy = v; by =V. 
In addition, we must eliminate +> and v in favor of our chosen con- 
nectives — and ~, Thus 


av B becomes =w —> ĝ; 


a«»f becomes —((a —> B) > —(B = @)). 


The finished product is 


Wo V0,( AV 93(V 14( (Eras —> (m = vw) > 


A2V40q)) —> (=w) > 22040) —> EV45)))))): 


The finished product is not as pleasant to read as the version that preceded 
it. As we have no interest in deliberately making life unpleasant for ourselves, 
we will eventually adopt conventions allowing us to avoid seeing the finished 
product at all. But for the moment it should be regarded as an interesting, 
even if unattractive, novelty. 


EXAMPLES in the language of elementary number theory Here it is in- 
tended that V should mean “for all natural numbers” and that <, 0, S, 
+, °, and E should have the obvious meanings. 


1. As a name for the natural number 2 we have ‘the term SSO, since 2 
is the successor of the successor of zero. Similarly, for 4 we have the term 
SSSS0. For the phrase “2 + 2” it is tempting to use SS0 + SSO. But we 
will adopt the policy of always putting the function symbol at the left 
(i.e., we will use Polish notation for function symbols). Thus corresponding 
to the English phrase “2 + 2” we have the term + SSO SSO. The English 
sentence “Two plus two is four” is translated as 


= -+ SSO S50 SSSSO. 


(The spaces are inserted to help the reader, but they do not constitute an _ 


official feature of the language.) 


ENPTA AEA TTS 


2.1 First-Order Languages 71 


sé i s 
2. “Any nonzero natural number is the successor of some number.” 
We will perform the translation in three steps. l 


Yv [If v, is nonzero, then », is the successor of some number.) 
Yolo A 0 => Iv, v ~ Sr). 
Vol ~0,0) + (Vv (m= Sv))). 


3. “Any nonempty set of natural numbers has a least element.” T his 
cannot be translated into our language, because we cannot express “any... 
set.” This requires either something like the (first-order) language for set 
theory or a second-order language for number theory. We could, however, 
translate, “The set of primes has a least element.” (The first step is to convert 
this sentence into, “There is a smallest prime.” We leave the other steps to 
the reader; hints can be found in the next section.) 


EXAMPLES in ad hoc languages 
1. “All apples are bad.” 

Wv,(Av, — Bo). 
2. “Some apples are bad.” 


Intermediate step: dv,(Av, A Boi). 


. Finished product: (=Vv,(—(-(A», + (—B»,))))). 


‘These two examples illustrate patterns which arise continually. An En- 
glish sentence which asserts that everything in a certain category has some 
property, is translated 


Vo(_ ). 
A sentence which asserts that there is some object or objects in the category 
and having the property, is translated 


IvA). 


The reader should be cautioned against confusing the two patterns. 
For example, 
W0,(Av, A Bv) 


translates “Everything is an apple and is bad,” which is a much stronger 
assertion than the sentence in the first example. Similarly, 3v,(Av, —> Br) 
translates “There is something which is bad, if it is an apple.” This is a much 
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weaker assertion than the sentence in the second example. It is true (vac- 
uously) even if all apples are good, provided only that the world has 
something which is not an apple. 


3. Bobby’s father can beat up the father of any other kid on the block. 
Establish a language in which V is intended to mean “for all people,” Kx 
is to mean “x is a kid on the block,” b is to mean “Bobby,” Bxy is to mean 
“x can beat up y,” and fx is to mean “‘the father of x.” Then a translation 
is 


Vo (Ko, — (=b) + Bfbfv,)). 
Formulas 


An expression is any finite sequence of symbols. Of course most expres- 
sions are nonsensical, but there are certain interesting expressions: the 
terms and the wffs. 


atomic other all other 
terms ; 
formulas wffs expressions 
ae 5 = 


The terms are the nouns and pronouns of our language; they are the 
expressions which can be interpreted as naming an object. The atomic 
formulas will be those wffs which have neither connective nor quantifier 
symbols. l 

The terms are defined to be those expressions which can be built up from 
the coństant symbols and the variables by prefixing the function symbols. 
To restate this in the terminology of Section 1.2, we define for each n-place 
function symbol f, an n-place term-building operation @; on expressions: 


Piles «+5 En) = fey + Ey. 


Definition The set of terms is the set of expressions generated from the 
constant symbols and variables by the @ operations. 


If there are no function symbols, then the terms are just the constant 
symbols and the variables. In this case we do not need an inductive 
definition. 

Notice that we use Polish notation for terms by placing the function 
symbol at the left. The terms do not contain parentheses or commas. We 
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will later prove a unique readability result, showing that the set of terms is 
freely generated. 

The terms are the expressions whieh: are translated as names of objects 
(noun phrases), in contrast to the wffs which are translated as assertions 
about objects. i l 

Some examples of terms in the language of number theory are 


+ v, S0, 
SSSSO, 
-+ E v, SS0 E v, SSO. 


The atomic formulas will play a role roughly analogous to that played 
by the sentence symbols in sentential logic. An atomic formula is an expres- 


sion of the form 
Pt, aa ta 5 


where P is an n-place predicate symbol and f, ..., ¢, are terms. 

For example, ~v,v, is an atomic formula, since ~ is a two-place pred- 
icate symbol and each variable is a term. In the language of set theory 
we have the atomic formula €9,v,. 

Notice that the atomic formulas are not defined inductively. Instead we 
have simply said explicitly just which expressions are atomic formulas. 

The well-formed formulas are those expressions which can be built up 
from the atomic formulas by use of the connective symbols and the quantifier 
symbol. We can restate this in the terminology of Section 1.2 by first defin- 
ing some formula-building operations on expressions: 


Zy) = (my), 
&(y, ô) = (y + 9), 
2y) = Wri y. 


Definition The set of well-formed formulas (wffs, or just formulas) is 
the set of expressions generated from the atomic formulas by the operations 


Za, ®,, and 2i = 1,2, ...). 
For example, ~w; is not a wif. (Why?) On the other hand, » 


Vv, (( AV 05(E 030;)) > (Vv Enp > 
(AV n (Ev > (7E%40)))))) 
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is a wff, as is demonstrated by the following tree: 


Vv, 
| 
Fegi 
N 
y a 
i s3 
|. | 
Yos Yv, 
| | 
= => 
| xox 
EVs, EVV = 
| 
Vy 


=p 
a 
€ VaV a at 


Evy 


But it requires some study to discover that this wff is the axiom of regularity: 


for set theory. 


Free variables 


Two examples of wffs are Vv, Evy, and (mYr, (Vv, Evw)). But there 
is an important difference between the two examples. The second «might 
be a translation from English of 


There is a set such that every set is a member of it. 


The first example, however, can only be a translation of an incomplete 
sentence, such as 


Every set is a member of wee: 
We are unable to complete the sentence without knowing what to do with »,. 


In cases of this sort, we will say that v, occurs free in the wif Vv, Ev. In 
contrast, no variable occurs free in (7Vv,(7Vv, Evw,)). But of course 
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we need a precise definition which does not refer to possible translations to 
English but refers only to the symbols themselves. 

Consider any variable x. We define, for each wff.a, what it means for x 
to occur free in œ. This we do by recursion: 


1. For atomic œ, x occurs free in œ iff x occurs in (is a symbol of) a. 
2. x occurs free in (a) iff x occurs free in œ. 

3. x occurs free in (æ — B) iff x occurs free in @ or in £. 

4. x occurs free in Vv, iff x occurs free in œ and x Æ vi. 


This definition makes tacit use of the recursion theorem. We can restate 
the situation in terms of functions. We begin with the function h defined on 


-atomic formulas: 


KE l if x is in the atomic formula a, 
O10 otherwise. 
And we want to extend A to a function Å defined on all wffs in such a way 
that - 
AE) = h(a), 


h(Z,(a, P) = max{h(a), APY, 


KANN ee 


Then we say that x occurs free in œ iff h(a) = 1. The existence of a unique 
such Å (and hence the meaningfulness of our definition) follows from the 
recursion theorem and from the fact (proved in Section 2.3) that the wffs 
are freely generated. 

If no variable occurs free in the wif œ, then « is a sentence. (The sentences 
are intuitively the wffs translatable into English without blanks, once we 
are told how to interpret the parameters.) l 

For example, Vv,(Av, —> Bv,) and Wvs (Pvs — Wv, Qv) are sentences, but 
», occurs free in (Wv, Av, > Bv,). The sentences are usually the most in- 
teresting wffs. The others lead a second-class existence; they are used pri- 
marily as building blocks for sentences. 

In translating a sentence from English, the choice of particular variables is 
unimportant. We earlier translated “All apples are bad” as Yy, (Av, —> By). 


“We could equally well have used 


Vvo (Ava —> Bva). 


‘The variable is, in effect, used as a pronoun, just as in English we might say, 
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“For any object whatsoever, if if is an apple, then if is bad.” Since the 
choice of particular variables is unimportant, we will often not even specify 
the choice. Instead we will write, for example, Vx(Ax — Bx), where it is 
understood that x is some variable. (The unimportance of the choice of 
variable will eventually become a theorem.) 

Similar usages of variables occur elsewhere in mathematics. In 


7 
È, dy 
iis a “dummy” variable but j occurs free. 


On notation 


We can specify a wff (or indeed any expression) by writing a line which 
displays explicity every symbol. For example, 


V0,((4~,0) = (~Vv,(-=9,Sp,))). 


But this way of writing things, while splendidly complete, may not be readily 
comprehensible. The incomprehensibility can be blamed (in part) on the 
simplifications we wanted in the language (such as the lack of an existential 
quantifier symbol). We naturally want to have our cake and eat it too, so 
we now will agree on methods of specifying wffs in more indirect but more 
readable ways. These conventions will let us write a line such as 


Volo A 0 > Av, v, ~ Sv) 


to name the same wff as is named by the other line above. 

Note well that we are not changing our definition of what a wif is. We 
are just conspiring to fix certain ways of naming wffs. In the (rare) cases 
where the exact sequence of symbols becomes important, we way have to 
drop these new conventions and revert to primitive notation. 

We adopt then the following abbreviations and conventions. Here g and 
B are formulas, x is a variable, and u and ¢ are terms. 


(a v p) abbreviates ((—0:) — £). 
(a a B) abbreviates (a(a@ => (—f))). 
(œ <=> B) abbreviates ((x —> f) a (B > @)); i.e., 


(e > £) > E > @)))). 


dxa abbreviates (~Vx(—a)). 


' 
| 
| 
| 
| 
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u =~ t abbreviates ~ut (and similarly for other two-place predicate 
symbols). l 

unk t abbreviates (aut); similarly u £ t abbreviates (~<ut). 

For parentheses we will use not only (and ) but also [ and ], ete. And 
we omit mention of just as many as we possibly can. Toward that end we 
adopt the following conventions: 

1. Outermost parentheses may be dropped. For example, Vx a <-> p. is 
(Yxa —> f). s 

2. =, V, and 3 apply to as little as possible. For example, 


saAB is (ma) ap), and not a(@ A f); 
Vxa-»B is (Vxa— f), and not Vx(o- A); 
axa ap is (xa B), and not Ax(a a £). 


In such cases we might even add gratuitous parentheses, as in (Ax æ) A £. 
3, a and v apply to as little as possible, subject to item 2. For example, 


~ A B -> y is (a) a p) => y. 


4. When one connective is used repeatedly, the expression is grouped to 


_ the right. For example, 


œa —>ß — y is œ => (B = y). 


EXAMPLES of how we can eliminate abbreviations, rewriting the formula 


- in an unabbreviated way that explicitly lists each symbol in order: 


1. Sx(Ax A Bx) is (AV x(a( (Ax > (-Bx))))). 

But (—=Vx(Ax -> (>Bx))) would be an equivalent formula (in any rea- 
sonable notion of equivalence). 

2. Ax Ax—> Bx is ((7AVx(7Ax)) — Bx). 

Jx(Ax — Bx) is (aVx(-(Ax — Bx))). 

We will try to use the various alphabets’ in a systematic way. The 
system is listed below, but there will be occasional exceptions for special 
reasons. 
` Predicate symbols: Uppercase italic letters. Also €, <. 

Variables: v;, U,V, X, Y, Z. 

Function symbols: f, g, h. Also S, +, ete. 

Constant symbols: a, b, o... Also 0. 

Terms: u, t. 
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: 

Formulas: Lowercase Greek letters, | 

Sentences: og, T. | 
Sets of formulas: Uppercase Greek letters, plus certain italic 

which pretend to be Greek, viz., A (alpha) and T (tau). | 

Structures: Uppercase German letters. | 


letters 


EXERCISES 


1. Assume that we have a language with the following parameters: 
Y, intended to mean “for all things”; N, intended to mean “is a number”; 
I, intended to mean “is interesting”; <, intended to mean “is less than”; 
and 0, a constant symbol intended to denote zero. Translate into this lan- 
guage the English sentences listed below. If the English sentence is am- 
biguous, you will need more than one translation. 

(a) Zero is less than any number. 

(b) If any number is interesting, then zero is interesting. 

(c) No number is less than zero. l 

(d) Any uninteresting number with the property that all smaller numbers 
are interesting certainly is interesting. 

(e) There is no number such that all numbers are less than it. 

(f) There is no number such that no number is less than it. 


specified, Make full use of the notational conventions and abbreviations to 
make the end result as readable as possible. 


2. Neither a nor b is a member of every set. (W, for all sets; €, is a member 


| 
In 2-6, translate each English sentence into the first-order language | 
| 
of; a, a; b, b.) 


| 
3. If horses are animals, then heads of horses are heads of animals. (Y, | 
for all things; Æ, is a horse; A, is an animal; hx, the head of x or (if x is 
headless) x itself.) 


4, (a) You can fool some of the people all of the time. (b) You can fool 
all of the people some of the time. (c) You can’t fool all of the people all of 
the time. (V, for all things; P, is a person; T, is a time; Fxy, you can fool 
x at y. One or more of the above may be ambiguous, in which case you 
will need more than one translation.) 


5. (a) Adams can’t do every job right. (b) Adams can’t do any job right. 
(V, for all things; J, is a job; a, Adams; Dxy, x can do y right.) 


6. Nobody likes everybody. (V, for all people; Lxy, x likes y.) 
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7. Give a precise definition of what it means for the variable x to occur 
free as the ith symbol in the wff œ. (If x is the ith symbol of œ but does not 
occur free there, then it is said to occur bound there.) 


8. Rewrite each of the following wffs in a way which explicitly lists each 
symbol in the actual order: 

(a) dv, Po, A Pr. 

(b) Yv, Av, A Bo, > dv, Cv, V Dvs. 


In each case, say which variables occur free in the wff. 


§2.2 TRUTH AND MODELS 


In sentential logic we had truth assignments to tell us which sentence 
symbols were to be interpreted as being true and which as false. In first- 
order logic the analogous role is played by structures, which can be thought 
of as providing translations from the formal language into English. (Struc- 
tures are sometimes called interpretations, but we prefer to reserve that 
word for another concept, to be encountered in Section 2.7.) 

A structure for a first-order language will tell us 

1. What collection of things the universal quantifier symbol (Y) refers 
to, and . 

2, What the other parameters (the predicate and function symbols) 
denote. 


Formally, a structure Wt for our given first-order language Is a function 
whose domain is the set of parameters and such that 


1. Ñ assigns to the quantifier symbol V a nonempty set |A |, called the 
universe of YX. ; 

2. ÙX assigns to each x-place predicate symbol P an n-ary relation 
P% < | Y |"; ie., P% is a set of n-tuples of members of the universe. 

3. A assigns to each constant symbol ¢ a member c™ of the universe | U |: 

4. Ù assigns to each n-place function symbol f an n-ary operation f 
on |Y |; ie, £% >| U [e — | WI. l 

The idea is that Ù assigns meaning to the parameters. V is to mean “for 
everything in | Ñ |.” The symbol c is to name the point c%. The atomic 
formula Ph - ++ t, is to mean that the n-tuple of points named by hiyas l 
is in the relation P%. (We will shortly restate these conditions more care- 
fully.) 


, by 
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Note that we require the universe | Ñ | to be nonempty. Notice also that 
f% must have all of | Y |” for its domain; we have made no provision for 
partially-defined functions. 


EXAMPLE Consider the language for set theory, whose only parameter 
(other than VW) is €. Take the structure U with 


|A| = the set of natural numbers, 
e% = the set of pairs <m, ny such that m <n. 


(Thus we translate € as “less than.’’) In the presence of a structure we can 

translate sentences from the formal language into English and attempt 

to say whether these translations are true or false. The sentence of this first- 

order language 
dx Vy my ex 


‘(or more formally, (4Vv,(—=V»,(—76E»,9,)))), which under another transla- : 
tion asserts the existence of an empty set, is now translated under Ù into | 


There is a natural number such that no 
natural number is smaller, 


which is true. Because of this we will say that 3x Vy ~y e x is true in Ñ, 
or that Y is a model of the sentence. On the other hand, Wf is not a model 
of the pair-set axiom, 


Vx Vy Iz VIÇ e z4 t= xy t= y), 


as the translation of this sentence under % is false. For there is no natural 
number m such that for every n, 


n<m if a= 1. 


(The reader familiar with axiomatic set theory can check that Y is a model 
of the extensionality axiom, the union axiom, and the axiom of regularity.) 


In the above example it was intuitively pretty clear that certain sentences 
of the formal language were true in the structure and some were false. But 
we want a precise mathematical definition of “ø is true in Y.” This should 
be stated in set-theoretic terms, without employing translations into English 
or a supposed criterion for asserting that some English sentences are true 
while the others are false. (If you think you have such a criterion, try it 
on the sentence ““This sentence is false.’’) In other words, we want to take 
our intuitive notion of “ø is true in W’ and make it part of mathematics. 
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In order to define “o is true in Y,” 
Fad, 
for sentences o and structures Ñ, we will find it desirable first to define a 
more general notion involving wffs. Let 


y be a wif of our language, 

a structure for the language, 

s:V—>|%X| a function from the set V of all variables into the uni- 
verse |A| of A. 


Then we will define what it means for Ù to satisfy p with s, 


Fa p [s]. 


The intuitive version is 


Hy @ [s] if and only if the translation of p determined by N, where the 
variable x is translated as s(x) wherever it occurs free, is true. 


The formal definition of satisfaction proceeds as follows: 
I. Terms. We define the extension 
§:T> |X, 
a function from the set T of all terms into the universe of Y. The idea is 


that 5(#) should be the member of the universe |Ñ | that is named by the 
term f. § is defined by recursion as follows: 


l. For each variable x, 5(x) = s(x). 
2, For each constant symbol c, 5(c) = c™. . 
3. If tj, ...5¢, are terms and f is an n-place function symbol, then 


aft, Stee ta) = fU), ter’ S(ta)). 


A commutative diagram, for n = 1, is 


rT |t] 
“|, |r 
T—— |X| 


The existence of a unique such extension § of s follows from the recursion 
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theorem (Section 1.2), by using the fact that the terms are freely generated 


(Section 2.3). Notice that depends both on s and on , 


II. Atomic formulas. The atomic formulas were defined explicitly, not 
inductively. The definition of satisfaction of atomic formulas is therefore 
also explicit, and not recursive. 


1. Fy tf, [s] iff S0) = Elt). 
(Thus ~ means =. Note that = is a logical symbol, not a parameter open 
to interpretation.) 

2. For an m-place predicate parameter P, 


Ea Pt, des bn [s] iff <5(t); ee 5(t,)> € Pp% 


MI. Other wfs. The wffs we defined inductively, and consequently here 
satisfaction is defined recursively. 


1. For atomic formulas, the definition is above. 
2. Fy =p [s] iff Fy o [s]. 
3. Fy (p> y) [s] iff either Éy g [s] or Eyy [s] or both. 
(In other words, if Ñ satisfies » with s then Y satisfies y with s.) 
4. Ey Wx o [s] iff for every de |A], we have Ey — [s(x | d). 


Here s(x|d) is the function which is exactly like s except for one thing: 

At the variable x it assumes the value d. This can be expressed by the equa- 
tion: 

sœ) ify A x, 

Ho ig oe 


(Thus Y means “for all things in |Ù |.”) 


At this point the reader might want to reconsider the intuitive. version 
of Fy p [s] on page 81 and observe how it has been formalized. We should 
also remark that the definition of satisfaction is another application of the 
recursion theorem together with the fact that the wffs are freely generated. 
The definition can be restated in terms of functions to make it clearer how 
the recursion theorem of Section 1.2 applies: \ 

(i) Consider one fixed Ù. 

(ii) Define a function Å (extending a function A defined on atomic for- 
mulas) such that for any wf p, /(p) is a set of functions from V into |% |. 

(iii) Define 

Fuapls] iff seko). 


2.2 Truth and Models 83 


We leave to the reader the exercise of writing down the explicit definition 
of h and the clauses which uniquely determine its extension Å. (See Exercise 
7.) An elegant alternative is to have A(p) be a set of functions on the set of 
those variables which occur free in ọ. 


Definition Let I be a set wffs, » a wff. Then I" logically implies 9, 


pe iff for every structure Ñ for the language and every function’ 


:V— |A| such that % satisfies every member of J’ with s, WU also 
satibes p with s. 


We use the same symbol, “H,” that was used in Chaper | for tautological 
implication. But henceforth it will be used solely for logical implication. 
As before we will write “y H g” in place of “{y} F p.” Say that p and y 
are logically equivalent (y | 4y) iff p Hy and y Eg. 

The first-order analog of the tautologies are the valid formulas. A wff 
p is valid iff Ø | g (written just “H g”). Thus @ is valid iff for every Ñ and 
every 5: V — |A|, A satisfies p with s. 

At this point we pause to verify that when we want to know whether or 
not a structure Ñ satisfies a wff m with s, we don’t really need all of the (in- 
finite amount of) information s gives us. All that matter are the values of the 
functions s at the (finitely many) variables which occur free in s. In particular, 
if y is a sentence, then s does not matter at all. 


‘Theorem 22A Assume that s4, s, are functions from V into |Ñ | which 
agree at all variables (if any) which occur free in the wff p. Then 


Ew p is] iff Fo g [sa]. 


Proof Recanse satisfaction was defined recursively, this proof uses 
induction. We consider the fixed structure Ñ and show by induction that 
every wif ọ has the property that whenever two functions s,, s, agree on the 
variables free in p, then Y satisfies p with s, iff it does so with s,. 


‘Case 1: p = Ph +t, is atomic. Then any variable in » occurs free. 
Thus s; and s, agree at all the variables in each ¢;. It follows that 5,(¢;) = 
5,(t;) for each i; a detailed proof would use induction on f. Consequently, 
YU satisfies Pt, --- 7, with s, iff it does so with s. 

Cases 2 and 3: has the form ~o or a -> p. These cases are immediate 
from the inductive hypothesis. 

Case 4: y = Vx y. Then the variables free in y are those free in y with 
the exception of x. Thus for any d in |% |, s,(x|d) and s.(x|d) agree at all 
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variables free in y. By inductive hypothesis, then, % satisfies y with s,(x|d) 
iff it does so with s,(x|d). From this and the definition of sanetecHon we 
see that Y satisfies Vx y with s, iff it does so with sz. A 


In effect, the above proof amounts to looking through the definition of 
satisfaction and seeing what information given by s was actually used. 
There is an analogous fact regarding structures: If Y and 8 agree at all the 
parameters which occur in p, then Fy ¢ [s] iff Eg g is]. 


Corollary 22B For a sentence o, either 


(a). U satisfies ø with every function s from V into |% |, or 
(b) Y does not satisfy o with any such function. 


If alternative (a) holds, then we say that ø is true in Ù (Hy o) or that Y 
is a model of ø. And if alternative (b) holds, then of course ø is false in Ù. 
(They cannot both hold since |% | is nonempty). Ù is a model of a set 2 
of sentences iff it is a model of every member of 2. 


Corollary 22C For a set X; t of sentences, X E+ 
& is also a model of t. 


iff every model of 


The definition of logical implication is very much like the definition of 
tautological implication in Chapter 1. But there is an important difference 
of cornplexity. Suppose in sentential logic you want to know whether or 
not a wff a is a tautology. The definition requires that you consider finitely 
many truth assignments, each of which is a finite function. For each such 
truth assignment v, you must calculate P(æ), which can be effectively done in 
a finite length of time. (Consequently, the set of tautologies is decidable, as 
was observed before.) 

In contrast to the finitary procedure for tautologies, suppose that you 
want to know whether or not a wff » (of our first-order language) is valid. 
The definition requires that you consider every structure YW. (In particular 
this requires using every nonempty set, of which there are a great many.) 
For each of these structures, you then must consider each function s from 
the set V of variables into | % |. And for each given % and s, you must de- 
termine whether or not Ñ satisfies p with s. When |Y | is infinite, this is a 
complicated notion in itself. 

In view of these complications, it is not surprising that the set of valid 


formulas fails to be decidable (cf. Section 3.5). What is surprising is that 


the notion of validity turns out to be equivalent to another notion (deduci- 
bility) whose definition is much closer to being finitary. (See Section 2.4.) 
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Using that equivalence we will be able to show (under some reasonable 
assumptions) that the set of validities (i.e., the set of valid wffs) is effectively 
enumerable. The effective enumeration procedure yields a more concrete 
characterization of the set of valid formulas. 

The notational conventions adopted earlier were done in a rational way: 


1. Hy (@ AB) [s] iff Hye [s] and Hy £ [s]; similarly for v and <>. 
2: ky Axo [s] iff there is some de |%| such that Fu @ [s(x]a)]. 


The proof for the second of these is as follows: 


Ey dxa@[s] iff ky mYxazis], 
iff it is not the case that for all d in |X |, ao Isla), 


iff for some d in |A|, Fae [s(x|d)]. 


Exampie Assume that our language has the parameters V, P (a two- 
place predicate symbol), f (a one-place function symbol), and c (a constant .Ț 
symbol), Let A be the structure for his language defined as follows: 


|W | = N,the set of all natural numbers, 
P% = the set of pairs <m, n> such that m<n, 
f* = the successor function S; fn =n + 1, 
“gil ʻ 0. 


We can summarize this in one line, by suppressing the fact that Ù is really 


< a function ‘and merély listing its components: 


W = (N, <, S, 0). 
This notation is unambiguous only when the context makes clear just which 
componenis go with which parameters. 

Let »:V—» N be the function for which s(v,) = i —1; 
S(v,.) = 1, ete. 


1. Sva) = SSQ) = 4. . 

2. (ffc) = 2; no use is made of s. . 

3. Hy Pcfv,'[s]. This is intuitively obvious, since when we translate back 
into English we get the true sentence “0 < 1.” More formally, the reason 
is that 


ie, s(v;) = 0, 


<5(e), Sfo) = <0, 1> e P”, 
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4. ky Vv, Pcv,. The translation into English is “0 is less than or equal to 
any natural number.” Formally we must verify that for any n in N, 


Hy Pev [s(v,|n)], 
which reduces to 
<0, n> e PY, 


5. Ey Vv, Poan, [s] because there is a natural number m such that 


Éu Pvt [slm]; 
Le., 


<s(va), m> ¢ PY, | 


In fact, since s(v,) = 1, we must take m to be 0. 
The reader is to be cautioned against confusing, for example, the function 
symbol f with the function f™. 


EXAMPLES of logical implication. The reader is invited to convince 
himself of each of the following: 


L Wo, Qn, E Ory. 

2. Qv, E Yri Ov. i 
3. Wi Ov, F Jv, Qo». : 
4, Ix Vy Pxy Yy Ax Pxy. : 
5, Wy dx Pxy $ Ix Vy Pxy. 

6. E Sx(Qx Wx Qx). 


EXAMPLE Suppose our given language has only the parameters V and P, 
where P is a two-place predicate symbol. Then a structure 9 is determined 
by the universe | Ù | and the binary relation P™. With some minor illegality 
we again write 


w = (|N |, P%. 


Now consider the problem of characterizing the class of all models of the 
following sentences: 


1. WxWy x y. A structure (A, R) is a model of this iff A contains exactly 
one element. R can either be empty or can be the singleton A X A. 

2. VxVy Pxy. A structure (A, R) is a model of this iff R= A x A. 
A can be any nonempty set. 

3. Vx Vy —Pxy. A structure (A, R) is a model of this iff R = ©. 

4, Wx dy Pxy. The condition for (A, R) to be a model of this is that 
dom R be A. 
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Definability of a class of structures 


For a set X of sentences, let Mod X be the class of all models of 2, i.e., 
the class of all structures for the language in which every member of 2 is 
true. For a single sentence t we write simply “Mod z” instead of “Mod {r}.” 
(The reader familiar with axiomatic set theory will notice that Mod 2, 
if nonempty, is a proper class; i.e., it is too large io be a set.) 


A class & of structures for our language is an elementary class (EC) iff. 


A = Mod +t for some sentence t. % is an elementary class in the wider 
sense (EC ,) iff X = Mod X for some set X of sentences. (The adjective 
“elementary” is synonymous with “‘first-order.’’) 


EXAMPLES 1. Assume that the language has equality and the parameters 
V and P, where P is a two-place predicate symbol. As before, a structure 
(A, R) for the language consists of a nonempty set A together with a binary 
relation R on A. (A, R) is an ordered set iff R is transitive and satisfies the 
trichotomy condition (which states that for any a and b in A, exactly one of 
<a, by € R, a = b, <b, a> € R holds). Because these conditions can be trans- 
lated into a sentence of the formal language, the class of nonempty ordered 
sets is an elementary class. It is, in fact, Mod r, where t is the conjunction 
of the three sentences 


Wx Vy Va(xPy —> yPz -= xPz); 
Vx Vy(xPy v x =æ y v pPx); 
Wx Vy(xPy => ayPx). 


The next two examples assume that the reader has had some contact 
with algebra. 

2. Assume that the language has az and the parameters V and o, where 
o is a two-place function symbol. The class of all groups is an elementary 
class, being the-class of all models of the conjunction of the group axioms: 


Vx Vy Wz x0(yoz) az (xo y)oz; 
YxVy Iz x oz = y; l 
WxVy Jz z ox =y. 

The class of all infinite groups is EC. To see this, let 


A, = dx dy x ae y, 
A, = dx Sy Sze yay zaxa az), 
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Thus A, translates, “There are at least n things.” Then the group axioms 
together with {A,, A,, ...} form a set Z for which Mod Z is exactly the class 
of infinite groups. We will eventually (in Section 2.6) be able to show that 
the class of infinite groups is not EC. 

3. Assume that the language has equality and the parameters V, 0, 1, +, 
e, Fields can be regarded as structures for this language. The class of all 
fields is an elementary class. The class of fields of characteristic zero is 
EC ,. It is not EC, a fact which will follow from the compactness theorem 
for first-order logic. (Section 2.6 again). 


Definability within a structure 


Consider a fixed structure Ñ. Suppose that gy is a forrnula such that 
all variables occurring free in p are included among v4, ..., V. Then for 
elements a,,..., ap of |A], 


Fup (a, ..-, ax) 


means that Y satisfies y with some (and hence with any) function s : V —> || 
for which sv) = a, 1<i<k. 
With cach such p and structure % we can associate the k-ary relation on 
LX], 
{<a,, aca! » Ay» : Fa p la, ai , apl- 


Call this the k-ary relation y defines in Ñ. In general, a k-ary relation on 
| A | is said to be definable in Ù iff there is a formula (whose free variables 
are among #,,..., V4) which defines it there. 


EXAMPLE Assume that we have a part of the language for number theory, 
specifically that our language has the parameters Y, 0, S, +, and -. Let 
N be the intended structure: 


| N| = N, the set of natural numbers. 
0? — 0, the number 0. 
S2, +", and -" are S, +, and -, the functions of successor, addition, 
, and multiplication. 
In one equation, , 
, N = (N, 0, S, +, +). 


‘ Some relations on N are definable in N and some are not. We know that | 


some are not definable because there are uncountably many relations on N 
but only $, possible defining formulas. (There is, however, an inherent 
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difficulty in giving a specific example. After all, if something is undefinable, 
then it is hard to say exactly what it is. Later we will get to see a specific 
example, the set of Gödel numbers of sentences true in N; see Section 3.5.) 


1. The ordering relation {<m, n> : m < n} is defined in N by the formula 
dvs vy + Sv, ~ vg. 


2. For any natural number n, {a} is definable. For example, {2} is de- 
fined by 
v = 880. 


Because of this we say that n is a definable element in N. 


3. The set of primes is definable in N. We could use the formula 
1 < vy AVV V0,(01 = 02° Va > Va = Í V vg = 1) 


if we had parameters 1 and < for | and <. But since {1} and < are de- 


finable in N, it is really quite unnecessary to add parameters for them; — 


we can just use their definitions instead. Thus the set of primes is definable by 
Jv, S0 +- Sv, = vy AWV V 05 (Di = Va * Vy => Va ~ SO V v3 = SO). 


4, Exponentiation, {<m, n, p> : p = m”} is also definable in N. This is 
much less obvious; we will give a proof later (in Section 3.7) using the 
Chinese remainder theorem. 


In fact, we will argue later that any decidable relation on N is definable 
in 9, as is any effectively enumerable relation and a great many others. To 
some extent the complexity of a definable relation can be measured by the 
complexity of the simplest defining formula, This idea will come up again 
at the end of Section 3.5. 


Momomorphisms 


Let %, B be structures for the language. A homomorphism h of Ù into 
¥ is a function A : |X | — |8| such that 


(a) For each n-place predicate symbol P and each n-tuple <a,, ..., An? 
of elements of |% |, 
Cis gee PR iE hay), -o Aan) E PP. 
(b) For each n-place function symbol f and each such n-tuple, 


Kf, sees a,,)) = f(a), eny hlan). 
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In the case of a constant symbol c this becomes 
A(e®) = c”, 


Conditions (a) and (b) are usually stated: “A preserves the relations and 
functions.” (It must be admitted that some authors use a weakened version 
of condition (a); our homomorphisms are their “strong homomorphisms. ”) 

If, in addition, / is one-to-one, it is then called an isomorphism of Ù into 
B, If there is an isomorphism of Ñ onto % (i.e., an isomorphism / for which 
ran A = | 8 |), then Y and 8 are said to be isomorphic ( = B). 

The reader has quite possibly encountered this notion before in special 
cases such as structures which are groups or fields. 


EXAMPLE Assume that we have a language with the parameters Y, +, 


and -. Let Ñ be the structure (N, +, -). We can define a function h: N . 


— {e, o} by 
fe if n is even, 
Mn) es \ o ifnis odd. 
Then / is a homomorphism of X onto B, where | 8 | = {e, o} and 4-9, -® 


are given by the following tables: 


+8 e o B e o 
e e o e e e 
o 0 e o e 0 


it can then be verified that condition (b) of the definition is satisfied. For 
example, if a and b are both odd, then A(a + b) =e and A(a) +” h(b) 
=0 +80 =e. 


EXAMPLE Let P be the set of positive integers, let <p be the usual 
ordering relation on P, and let <y be the usual ordering relation on N. Then 
there is an isomorphism / from the structure (P, <p) onto (N, <y); we 


take A(n) = n — 1. Also the identity map i: P— N is an isomorphism : : 


of (P, <p) into (N, <y). Because of this last fact, we say that (P, <p) 
is a substructure of (N, <y). 


More generally consider two structures A and B for the language such 
that [Y| = |B]. It is clear from the definition of homomorphism that 


the identity map from |Y | into | B| is an isomorphism of Ñ into % iff 
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_ (a) P% is the restriction of P® to |% |, for each predicate symbol P; 
(b) fY is the restriction of f® to |Y |, for each function symbol f, and 
cl = c® for each constant symbol c. 


If these conditions are met, then W is said to be a substructure of B, and 
% is an extension of Ù. 

These are basically algebraic notions, but the following theorem relates 
them to the logical notions of truth and satisfaction. 


Homomorphism Theorem Let h be a homomorphism of Ù into Y, and 
let s map the set of variables into |% |. 

(a) For any term ¢, h(S(t)) = hos(t), where 5(f) is computed in X and 
ho s(t) is computed in %8. 

(b) For any quantifier-free formula œ not containing the equality symbol, 


Ug 


gals] iff Ege [hos]. 


(c) If h is one-to-one (i.e., is an isomorphism of A into B), then in part 


(b) we may delete the restriction “not containing the equality symbol.” 
(d) If h is a homomorphism of Ù onto B, then in (b) we may delete the 
restriction “quantifier-free.” 


Proof Part (a) uses induction on £; see Exercise 13. 
(b) For an atomic formula such as Pt, we have 


Hy Pt [s] <> 5(¢) e PY 
<> h(5(t)) E P® 
hos(t)e P® by (a) 
<> ky Pt [hos]. 


since A is a homomorphism 


An inductive argument is then required to handle the connective symbols 


= and —», but it is completely routine. 
(c) In any case, 
Ey u = t [s] > 5(u) = 5) 
. => h(5(u)) = AGO) 
< hos) =hos(t) by (a) 


<> Hp u =~ t [hos], 


_If h is one-to-one, the arrow in the second step can be reversed. 
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(d) We must extend the routine inductive argument of part (b) to include 
the quantifier step. That is, we must show that if@ has the property that for 
every s, 


Pap Gaeta: 


then Vx y enjoys the same property. We have in any case (as a consequence 
of the inductive hypothesis on g) the implication 


. FaVx g [hos] => Ey Wx ¢ [s]. 


This is intuitively very plausible; if y is true of everything in the larger set 
| B|, then a fortiori it is true of everything in the smaller set |N |. The 
details are, for an element a of | % |, 


Fa Yx p [hos] = Fa ph os)(x|A(@))] 


B <> Eg y [h o0 (s(x{a))], 
<> Fa @ [s(x]a)] 


the functions being the same 
by the inductive hypothesis. 


For the converse, suppose that fyVx p [hos], so that Ky [(h o s)(x|b)] 
for some element b in | 8 |. We need the implication 


(x) If for some b in | Bl, Ey =p [(h o s)(x|b)], then for some a 
in [UA], Eomp [ro Axala. 
For given (*), we can proceed: 
En “ip [(A o s)(x|h(a))] <> Ey me [h o (s(xla))], the functions being the same 


<> Fa ap [s(x|a)] 
=> Fy Vx o [s]. 


by the inductive hypothesis 


If h maps | Ù | onto | ® |, then (*) is immediate; we take a such that b = h(a). 
(But there might be other fortunate times when (+) can be asserted even if 
h fails to have range | 8|.) 


Two structures Ù and B for the language are elementarily equivalent 
(X = 8) iff for any sentence o, 


Fy o <> Fy a. 


As a corollary to the above theorem we have: Isomorphic structures are 
elementarily equivalent. Actually more is true. Isomorphic structures are 
alike in every “structural” way; not only do they satisfy the same first- 
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order sentences, they also satisfy the same second-order (and higher) sen- 
tences (i.e., they are secondarily equivalent and more). 

There are elementarily equivalent structures which are not isomorphic, 
For example, it can be shown that the structure (R, <p) consisting of 
the set of real numbers with its usual ordering relation is elementarily 
equivalent to the structure (Q, <q) consisting of the set of rational numbers 
with its ordering (see Section 2.6). But Q is a countable set whereas R is 
not, so these structures cannot be isomorphic. In Section 2.6 we will see 
how easy it is to make elementarily equivalents structures of differing 
cardinalities. 


EXAMPLE, revisited We had an isomorphism h from (P, <p) onto 
(N, <y). So in particular, (P, <p) = (N, <y); these structures are in- 
distinguishable by first-order sentences. 

We furthermore noted that the identity map was an isomorphism of 
(P, <p) into (N, <y). Hence for a function s : V —> P and a quantifiers 
free p, 


Fip,<p P [8] <> Fw, <m p [s]. 
This equivalence may fail if pọ contains quantifiers. For example, 


Ep, <p) VV FY va —> vy < v2) [L], 
but 
E, <yy Vvo FY va > v, <w) [1]. 


An automorphism of the structure Ù is an isomorphism of % onto X. 
The identity function on |A | is trivially an automorphism of A. Ù may 
or may not have nontrivial automorphisms. As a consequence of the homo- 
morphism theorem, we can show that an automorphism must preserve the 
definable relations: 


Corollary 22D Let A be an automorphism of the structure W, and 
let R be an n-ary relation on |Ù | definable in A. Then for any a, <., ap 
in | |, 

(Oy, +145 A> E R & Cha), ..., h(a, E R. 


‘Proof Let p be the formula which defines R in YW. We need to know 
that 


ku p lars -> Gl Fu p TAa, ..., ACa,) I. 


But this is immediate from the homomorphism theorem. 5 


| 
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This corollary is sometimes useful in showing that a given relation is 
not definable. Consider, for example, the structure (R, <} consisting of 
the set of real numbers with its usual ordering. An automorphism of this 
structure is simply a function 4 from R onto R which is strictly increasing: 


a <b <> h(a) < hb). 


One such automorphism is the function A for which h(a) = a. Since this 
function maps points outside of N into N, the set N is not definable in 
this structure. 

Another example is provided by elementary algebra books, which some- 


times explain that the length of a vector in the plane cannot be defined 


in terms of vector addition and scalar multiplication. For the map which 


takes a vector x into the vector 2x is an automorphism of the plane with 


respect to vector addition and scalar multiplication, but it is not length- 
preserving. From our viewpoint, the structure in question, 


(E, +s A rer» 


has for its universe the plane Æ, has the binary function -+ of vector ad- 
dition, and has (for each r in the set R) the unary function f, of scalar 
multiplication by r. (Thus the language in question has a one-place function 
symbol for cach real number.) The doubling map described above is an 
automorphism of this structure. But it does not preserve the set of unit 
vectors, 

{x: xe E and x has length 1}. 


So this set cannot be definable in the structure. (Incidentally, the homomor- 
phisms of vector spaces are normally called /inear transformations.) 


EXERCISES 


1. Show that (a) ';0 | p iff I f (œ > ¢); and (b) p F 4y iff Hipy) 


2. Show that no one of the following sentences is logically implied by 


the other two. (This is done by giving a structure in which the sentence in 


question is false, while the other two are true.) 


(a) Vx Wy Vz(Pxy — Pyz —> Pxz), Recall that by our convention a —> f 
=> y is g => (P > y). 

(b) Yx Vyp(Pxy — Pyx — x = y). 

(c) Vx Sy Pxy — dy Vx Pxy. 
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3. Show that 
{Yx — p), Vx a} E Vx e. 


4. Show that if x does not occur free in a, then a H Yxa. 


5. Show that the formula x ~ y — Pzfx = Pzfy (where f is a one-place 
function symbol and P is a two-place predicate symbol) is valid. 


. 6. Show that a formula 6 is valid iff Vx 0 is valid. 


7. Restate the definition of “Ù satisfies p with s” in the way described 
on pages 82f. That is, define by recursion a function h such that Y satisfies 
p with s iff s € Ap). 

8. Assume that X is a set of sentences such that for any sentence t, 
either X Ex or X H =r, Assume that Ù is a model of X. Show that for any 
sentence T, Fy 7 iff 2’ Ez. l 


9. Assume that the language has equality and a two-place predicate 


symbol P. For each of the following conditions, find a sentence o such that 


the structure N (= (|Ù |, P%) is a model of o iff the condition is met. 


(a) |A| has exactly two members. 

(b) PY is a function from | | into |% |. 

(c) PY is a permutation of |Ù |; i.e, P% is a one-to-one function with 
domain and range equal to |% |. 


10. Show that 
Fa Vo, Qov [c*] iff Hy Ve, Qen. 


Here Q is a two-place predicate symbol and c is a constant symbol. 


ll. For each of the following relations, give a formula which defines 
it in (N, +, +). (The language is assumed to have equality and the pa- 
rameters V, +, and -.) 


(a) {0}. 

(b) {1}. 

-(c) {<m, ny :n is the successor of m in N}. 

(d) {<m,n>:m <n in N} 

12. Let R be the structure (R, +, +). (The language is assumed to have 
equality and the parameters V, +, and +. N is the structure whose universe 
is the set R of real numbers and such that -+-" and -® are the usual addition 
and multiplication operations.) 
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(a) Give a formula which defines in 9 the set [0, oo). 

(b) Give a formula which defines in 8 the set {2}. 

*(c) Show that any finite union of intervals, the endpoints of which 
are algebraic, is definable in R. (The converse is also true; these are the 
only definable sets in the structure. But we will not prove this fact.) 


13. Prove part (a) of the homomorphism theorem. 


14, What subsets of- the real line R are definable in (R, <)? What 
subsets of the plane R x R are definable in (R, <)? 


15. Show that the addition relation, {<m, n, pò : p = m + n}, is not 
definable in (N, +). Suggestion: Consider an automorphism of (N, +) 
which switches two primes. 


16. Let Ù be a structure; let B be a set which includes | % |. Show that 
there is a structure 8 whose universe is B and such that for any sentence o 
not containing =~, Fy o iff Hyo. Suggestion: Choose some a, € ||. 
Let h : B — | Ù | map points in | Y | into themselves and map other points 
of B into a. Define 8 so that h is a homomorphism of 8 onto XW. 


17. (a) Consider a language with equality whose only parameter (aside 
from VY) is a two-place predicate symbol P. Show that if W is finite and 
YW = B, then W is isomorphic to B. 

*(b) Show that the result of part (a) holds regardless of what parameters 
the language contains. 


18. A universal (V,) formula is one of the form Yx +: Wx, 0, where 
6 is quantifier-free. An existential (3,) formula is of the dual form 4x, +++ Ix, 
0. Let A be a substructure of B, and let s : V — |W]. 


(a) Show that if Fy p [s] and @ is universal, then Fy p [s]. And if Ey y Is] 
and œ is existential, then Fg y [s]. 

(b) Conclude that the sentence Vx Px is not logically equivalent to any 
existential sentence, nor Jx Px to any universal sentence. 


19. An A, formula is one of the form Ax, -+-+ dx, 0, where 0 is universal. 


(a) Show that if an 4, sentence not containing function symbols is true 
in Y, then it is true in some finite substructure of Y. 

(b) Conclude that Vx Ay Pxy is not logically equivalent to any 3, sen- 
tence. 


20. Assume the language has equality aud a two-place predicate symbol 
P. Consider the two structures (N, <) and (R, <) for the language. 
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(a) Find a sentence true in one structure and false in the other. 
*(b) Show that any 4, sentence (as defined in the preceding exercise) 
true in (R, <) is also true in (Ñ, <). 


31. We could consider enriching the language by the addition of a new 
quantifier. The formula d!x a (read: there exists a unique x such that ø) 
is to be satisfied in A by s iff there is one and only one a € | | such that 
ky a [s(x|@)]. Assume that the language has the equality symbol and show 


that this apparent enrichment comes to naught, in the sense that we can — 


find an ordinary formula a’ logically equivalent to 3!x a. 


22, Let Ý be a structure and g a one-to-one function with dom g = | % |. 
Show that there is a unique structure 8 such that g is an isomorphism of 


Y onto %. 


23. Let A be an isomorphism of Ù into B. Show that there is a structure 
€ isomorphic to B such that U is a substructure of ©. Suggestion: Let g 
be a one-to-one function with domain | 8| such that g(h(a)) = afora e |X|. 
Form © such that g is an isomorphism of 8 onto @. 


24, Consider a fixed structure YX. Expand the language by adding a new 
constant symbol c, for each a € |A |. Let Ñ* be the structure for this ex- 
panded language which agrees with Ñ on the original parameters and which 


assigns to cy the point a. A relation R on |Ù | is said to be definable from 


points in Ñ iff R is definable in Y+. (This differs from ordinary definability 
only in that we now have parameters in the language for members of | % |.) 
Let R= (R, <, +, +). . 

(a) Show that if A is a subset of R consisting of the union of finitely 
Many intervals, then A is definable from points in R. 

(b) Assume that X =N. Show that any subset of |Y | which is non- 
empty, bounded (in the ordering <™), and definable from points in Ù has 
a least upper bound in |Ù |. 


§ 2.3 UNIQUE READABILITY! 


As in sentential logic, we need unique readability results in order to be 
able to apply the recursion theorem. If we had used Polish notation both 
for terms and for formulas, then we could give one proof that. would yield 


1 This section may be omitted by a reader willing to accept the meaningfulness of our 
Many definitions by recursion. 
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simultaneously the facts that the terms are freely generated and that th 
formulas are freely generated. But, in fact, we have used a mixed notation, 


Consequently, we will give first a unique readability proof for terms; | 


actually that proof will be applicable to any use of Polish notation. And | 


then we will give an extension of that proof to cover the formulas. 
Recall that the set of terms is generated from the variables and constant 
symbols by operations corresponding to the function symbols. We now 
define a function K on the symbols involved such that for a symbol s, 
K(s) = 1 — n, where n is the number of terms which must follow s to obtain 
a term. 
K(x) =1-0=1 
Ke) =1-0=1 
KS) =1—- 4 


for a variable x; 
for a constant symbol c; 
for an n-place function symbol f. 


We then extend K to the set of expressions using these symbols by defining . 


K(5182 ey Sp) = K(s) + K(S9) i K(s,). 
Since no symbol is a finite sequence of others, this definition is unambiguous. 
Lemma 23A For any term ¢, K(f) = 1. 


Proof Use induction on £. The inductive step, for an n-place function 
symbol f, is 


Kft, +i, =A—n ++ ee tHDH=H 1. 


In fact, K was chosen to be the unique function on these symbols for 
which Lemma 23A holds. It follows from this lemma that if ¢ is a concat- 
enation of m terms, then K(e) = m. 

By a terminal segment ‘of a sequence <s,,...,8,> of symbols we mean 
a sequence of the form Cs, Spyro oeeo Sps Where LS k <a, , 


Lemma 23B Any terminal segment of a term is a concatenation of 
one or more terms.. 


Proof We use induction on the term. For a one-symbol term (i.e, a 
variable or a constant symbol) the conclusion follows trivially. For a term 


Jt; +++ t,, any terminal segment (other than the term itself) must equal 


1 
tile te bns 


where k <n and t; is a terminal segment of t}. By the inductive hypothesi 
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i, is a concatenation of, say, m terms, where m > 1. So altogether we have 
m+ (n — k) terms. 


Corollary 23C No proper initial segment of a term is itself a term. 


Proof Suppose a term ¢ is divided into a proper initial segment f, and 
a terminal segment f. Then | = K(t) = K(t,) + K(f), and by Lemma 
23B, K(t,) > 1. Hence K(t,) <1 and f cannot be a term. 


Unique Readability Theorem for Terms The set of terms is freely gen- 
erated from the set of variables and constant symbols by the @ operations. 


Proof First, it is clear that if fA g, then ran Z is disjoint from ran F 
this requires checking only the first symbol. Furthermore, both ranges are 
disjoint from the set of variables and constant symbols, It remains only 
to show that @;, when restricted to terms, is one-to-one. Suppose, for a 


two-place jf, we have 


Site = fists. 
By deleting the first symbol we are left with 
lilo = Eat. 


If 4 Æ t, then one would be a proper initial segment of the other, which 
is impossible for terms by the above corollary. So 4; = f,, and we are then 


left with f, = ty. a 


To extend this argument to formulas, we now define K on the other 


symbols: 
KQ=-h 
K()) = 1; 
KW) = —1; 
K(>) = 0; 
Ke) = —l; 


K(P) = 1 — n for an n-place predicate symbol P. 


along with s. We extend K as usual to the set of all expressions: 


K(s, ae Sp) = K(sy) iets Ste K(s,). 


The idea behind the definition is again that K(s) should be 1 — n, where 
nis number of things (right parentheses, terms, or formulas) required to go 


| 
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Lemma 23D For any wf œ, K(æ) = 1. 


Proof Another straightforward induction. 
Lemma 23E For any proper initial segment a’ of a wff a, K(a’) < 1. 
Proof Use induction on œ. The details are left to Exercise 1. a 


Corollary 23F No proper initial segment of a formula is itself a 
formula. 


Unique Readability Theorem for Formulas The set of wffs is freely 
generated from the set of atomic formulas by the operations &,, &,, 
and 2i = 1, 2; h .). 

Proof The unary operations 22, and 2; are obviously one-to-one. 
As in Section 1.4, we can show that the restriction of &, to wffs is one- 
to-one. 

The disjointness half of the theorem follows from the ad hoc observa- 
tions: . 

l. ran &,, ran 9;, ran d j, and the set of atomic formulas are pairwise 
disjoint, for i j. (Just look at the first two symbols.) 

2, ran &,, ran 2;, ran Q;, and the set of atomic formulas are similarly 
pairwise disjoint, for i>4 j. 

3. For a wif B, (aa) 4 (8 > y), because no wff begins with =. Hence 
ran &, is disjoint from the range of the restriction of &, to wits. iM 


As in Section 1.4, we can obtain an effective algorithm for finding the 
unique tree displaying the pedigree of a term or formula. There we counted 
parentheses in initial segments; now we use the function K in the analogous 
role. 


EXERCISES 


1. Show that for a proper initial segment’ of a wff æ, we have K(a') < 1. 


2. Let s be an expression consisting of variables, constant symbols, and 
function symbols. Show that e is a term iff K(e) = 1 and for every terminal 
segment e’ of e we have K(«') > 0. Suggestion: Prove the stronger result 
that if K(e’) > 0 for every terminal segment ¢' of e, then e is a concatenation 
of K(e) terms. 
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§2.4 A DEDUCTIVE CALCULUS 


Suppose that X | r. What methods of proof might be required to demon- 
strate that fact? Is there necessarily a proof at all? 

Such questions lead immediately to considerations of what constitutes 
a proof. A proof is an argument you give to someone else and which com- 
pletely convinces him of the correctness of your assertion (in this case, 
that Z Er). i 

Thus a proof should be finitely long, as you cannot give all of an infinite 
object to another person. If the set 2/ of hypotheses is infinite, they cannot 
all be used. But the compactness theorem for first-order logic (which we 
will prove in Section 2.5 using the deductive’ calculus of this section) will 
ensure the existence of a finite X, S X such that Jọ E t. 

Another essential feature. of a proof (besides its being finite in length) 


‘is that it must-be possible for someone else (if he is to be convinced by it) 


to check the proof to ascertain that it contains no fallacies. This check must 


be effective; it must be the sort of thing that can be carried out without - 


brilliant flashes of insight on the part of the checker. In particular, the set 
of proofs from the empty set of hypotheses (i.e., proofs that H t) must be 
decidable. This implies that the set of sentences provable without hypotheses 
must be effectively enumerable. For one could in principle enumerate prov- 


- able sentences by generating all strings of symbols and sorting out the 


proofs from the nonproofs. When a proof is discovered, its last line is entered 
on the output list. (This issue will be examined more carefully at the end 


_ of Section 2.5.) But here again there is a theorem (the enumerability theorem, 


proved in Section 2.5) stating that, under reasonable conditions, the set of 
valid senteiices is indeed effectively enumerable, 

Thus the compactness theorem and the enumerability theorem are neces- 
sary conditions,for proofs of logical implication always to exist. Conversely, 
we claim that these two theorems are sufficient for proofs to exist. For 
suppose that 2/ F v. By the compactness theorem, then, there is a finite 


Set {o, ..., Cp} S 4 which logically implies +. Then 09> +: => 0,1 


is valid (Exercise 1, Section 2.2). So to demonstrate conclusively that X } t 
one need only carry out a finite number of steps in the enumeration of the 
validitiés until Og — +++ —> O,->T appears, and then verify that each o; € X. 
(This should be compared with the complex procedure suggested by the orig- 
inal definition of logical implication, discussed on page 84.) The record of 
the enumeration procedure which produced og —> +++ -=> 0,7 can then be 
regarded as a proof that Xt. As a proof, it should be acceptable to anyone 
who accepts the correctness of your procedure for enumerating validities. 


102 2. FIRST-ORDER LOGIC 


Against the foregoing general (and slightly vague) discussion, the outline 
of this section can be described as follows: We will introduce formal proofs 
but we will call them deductions, to avoid confusion with English language 
proofs. These will mirror (in our model of deductive thought) the proofs 
made by the working mathematician to convince his colleagues of certain 
truths. Then in Section 2.5 we will show that whenever X } r, there is a | 
deduction of t from 5 (and only then). This will, as is suggested by the 
foregoing discussion, yield proofs of the compactness theorem and the 
enumerability theorem. And in the process we will get to see what methods 
of deduction are adequate to demonstrate that a given sentence, is, in fact, 


logically implied by certain others. 


Formal deductions 


We will shortly select an infinite set A of formulas to be called logical 
axioms. And we will have a rule of inference, which will enable us to obtain a 
new formula from certain others. Then for a set I of formulas, the theorems | 
of I’ will be the formulas which can be obtained from T U A by use of 
the rule of inference (some finite number of times). If p is a theorem of r 
(written Tj- g), then a sequence of formulas which records (as explained | 
below) how p was obtained from T'U A with the rule of inference will | 
be called a deduction of p from I. : 

The choice of A and the choice of the rule (or rules) of inference are far | 
from unique. In this section we are presenting one deductive calculus for | 
first-order logic, chosen from the array of possible calculi. (For example, 
one can have A = Ø by using many rules of inference. We will take the op- | 
posite extreme; our set A will be infinite but we will have only one rule of | 
inference.) 

Our one rule of inference is traditionally known as modus ponens. It is 
usually stated: From the formulas œ and œ — B we may infer P: 


a, œ => f 


B 


Thus the theorems of a set J" are the formulas obtainable from /’U A 
by use of modus ponens some finite number of times. 

We can also look at modus ponens in the following way. Call a set A 
of formulas closed under modus ponens iff whenever two formulas œ and 
œ — B are in A, then also £ is in 4. For a fixed set I’, say that A is inductive 
iff u A c A and A is closed under modus ponens. Then the set of theo- 
rems of T is simply the smallest inductive set, This differs from the situatio 
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of Section 1.2 only in that instead of closing the initial set under (everywhere- 
defined) functions, we are closing it under a relation which is only a ‘‘par- 
tially-defined”’ function. (Its domain consists not of all pairs of formulas, 
but only of pairs, of the form <a,a-> B>.) The ideas of Section 1.2 are, 
however, applicable to the present situation. Thus we may define to be a 
theorem of I’ (written I | p) iff y belongs to the set generated from P U A 
by modus ponens. (But the reader should be warned that the set of theorems 
of F is not freely generated. This reflects the fact that a theorem never has 
a unique deduction.) AE 

For example, if the formulas f, y, and y —> p ->œ are all in PUA, 
then Tj- æ, as is evidenced by the tree 


which displays how « was obtained. Although it is tempting (and in some | 


ways more elegant) to define a deduction to be such a tree, it will be simpler 
to take deductions to be the linear sequences obtained by squashing such 
trees into straight lines. 


Definition A deduction of » from T is a sequence (ay, ...,@,> of for- 


mulas such that a, = p and for each i < n either 

(a) œ; is in P U A, or 

(b) for some j and k less than i, œ; is obtained by modus ponens from 
a; and a, (i.€., Qp = Oj; => @). 

In Section 1.2 sequences of the sort just described (i.c., deductions) 
were called construction sequences. We showed there that C,, (the set of 
things having construction sequences) coincides with C* (the intersection 
of all inductive sets). We now repeat that result, in the present case of gen- 
eration from P'U A by modus ponens: 


Theorem 24A ‘There exists a deduction of a from J’ iff œ is a theorem 
of r, 


Proof If there is a deduction <a, ...,@,>, then (by induction on i) 
each œ; belongs to the set generated from I” U A by modus ponens. Con- 
versely, the set of formulas for which deductions exist includes I'U A 
(whose members have one-line deductions) and is closed under modus 


Ponens. (Why?) Consequently, this set includes all theorems of J’. 


104 2. FIRST-ORDER LOGIC 


This justifies defining p to be deducible from D iff | ¢. 

Now at last we give the set A of logical axioms. These are arranged in 
six groups. Say that a wif y is a generalization of y iff for some n > 0 and 
some variables x1, ..., Xp, 


p = Wx Yxa Y. 


We include the case n = 0; any wif is a generalization of itself. The logical 
axioms are then all generalizations of wffs of the following forms, where 
x and y are variables and œ and f are wffs: 


1. Tautologies; 

2. Wxa—>a%, where ¢ is substitutable for x in a; 
3. Wx(a—> B) — (Yxa — Vx f); 

4. œ= Wxa, where x does not occur free in a. 


And if the language includes equality, then we add 


5. x= X; 
6. x = y = (æ —a'), where œ is atomic and a’ is obtained from a by 
replacing x in zero or more (but not necessarily all) places by y. 


For the most part groups 3-6 are self-explanatory; we will see various 
examples later. Groups 1 and 2 require explanation. But first we should 
admit that the above list of logical axioms may not appear very natural. 
Later it will be possible to see where each of the six groups originated. 


Substitution 


In axiom group 2 we find 


Vxa->of. 


Here o% is the expression obtained from the formula æ by replacing the 
variable x, wherever it occurs free in a, by the term ¢. This notion can also 
be (and for us officially is) defined by recursion: 


1. For atomic a, o% is the expression obtained from œ by replacing the 
variable x by £. (This is elaborated upon in Exercise 1. Note that af is itself 
a formula.) 

2. (ma)? = (07). 

3. (œ — BY = (œr > pi). 


Vya if x=y 
oo > 
none (ven if xy. 
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EXAMPLES 


l. Pn = P. E 
2. (Qx —> Wx Px) = (Qy > Vx Px). 
3, If a is =Wy x = y, then Vxa—-az is 


Vx =y x = po NyZ = y. 
4, For a as in 3, Yxa = æy is 
Vx Ny x = y => myy = y. 


The last example above illustrates a hazard which must be guarded 
against. On the whole, Vx a> af seems like a plausible enough axiom. 
(“if æ is true of everything, then it should be true of ¢.’”) But in example 4, 
we have a sentence of the form Vxa—>of, which is nearly always false. 
The antecedent, Vx ~Wy x ~ y, is true in any structure whose universe 
contains two or more elements. But the succedent, ~Vy y =~ y, is false 
in any structure. So something has gone wrong. 

The problem is that when y was substituted for x, it: was immediately 
“captured” by the Vy quantifier. We must impose a restriction on axiom 
group 2 that will preclude this sort of quantifier capture. Informally, we 
can say that a term £ is not substitutable for x in @ if there is some variable 
y in ¢ that is captured by a Vy quantifier in ay. The real definition is given 
below by recursion. (Since the notion will be used later in proofs by in- 
duction, a recursive definition is actually the most usable variety.) 

Let x be a variable, ¢ a term. We define the phrase “‘t is substitutable for 
x in œ” as follows: 


1. For atomic a, t is always substitutable for x in æ. (There are no quanti- 
fiers in œ, so no capture could occur.) 

2. tis substitutable for x in (~¢) iff it is substitutable for x in æ. £ is 
substitutable for x in (æ — £) iff it is substitutable for x in both æ and £, 

3, ¢ is substitutable for x in Vy a iff either 


(a) x does not occur free in Vya@, or 
(b) y does not occur in ¢ and t is substitutable for x in æ. 
(The point here is to be sure that nothing in ¢ will be captured by the Vy 
prefix and that nothing has gone wrong inside @ earlier.) 


For example, x is always substitutable for itself in any formula. If ¢ 
contains no variables which occur in œ, then f is substitutable for x 
in a, 
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The reader is cautioned not to be confused about the choice of words, 
Even if ¢ is not substitutable for x in a, still af is obtained from æ by replac- 
ing x wherever it occurs free by ¢. Thus in forming œf, we carry out the 
indicated substitution even if a prudent person would think it unwise to 
do so. l 

Axiom group 2 consists of all generalizations of formulas of the form 


Vxa—> of, 


example, 


| 
where the term ¢ is substitutable for the variable x in the formula a. For 
Wv3(Wv,(Av, -> Vv, Ada) => (Av, => Vv, Av:)) 


is in axiom group 2. Here x is v, œ is Av, Vv; Av,, and t is v. On the 
other hand, 
Vv, Vv, Boy, — Wv Bv 


‘is not in axiom group 2, since v, is not substitutable for v, in Vv, Byw. 


RANE GNSS 


Tautologies 


Axiom group 1 consists of generalizations of formulas to be called tau- 
tologies. These are the wffs obtainable from tautologies of sentential logic 
(having only the connectives ~ and —) by replacing each sentence symbol 
by a wff of the first-order language. For example, 


renee 


Vx[(Wy Py > Px) => (Px > “Vy Py) 


belongs to axiom group 1. It is a generalization of the formula in square __ 
- brackets, which is obtained from a contraposition tautology 


(A — ~B) => (B — A) 


by replacing A by Vy ~Py and B by Px. 
There is another, more direct, way of looking at axiom group 1. Divide 
the wffs into two groups: i 


1. The prime formulas are the atomic formulas and those of the form 
Yxa. ' 

2. The nonprime formulas are the others, i.e., those of the form —a 
or œ = p. 
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Thus any formula is built up from prime formulas by the operations 
Z and &,. Now go back to sentential logic, but take the sentence symbols 
to be the prime formulas of our first-order language. Then any tautology 
of sentential logic (which uses only the connectives —, —>) is in axiom group 
1. There is no need to replace sentence symbols here by first-order wits ; 
they already are first-order wffs. Conversely, anything in axiom group | 
is a generalization of a tautology of sentential logic. (The proof of this 
uses Exercise 7 of Section 1.3.) 


EXAMPLE, revisited 
(Wy Py => —Px) > (Px > =y —Py). 


This has two sentence symbols (prime formulas), Vy —~Py and Px. So its 
truth table has four lines: 


(Vy Py + Px) > (Px => “Wy Py) 


T FFT T T FFT 
T TTF T F TFT 
F TFT T TTTEF 
F TTF T F TTF. 


From the table we see that it is indeed a tautology. 
On the other hand, neither Wx(Px > Px) nor Wx Px— Px is a tau- 
tology. . 


One remark: We have not assumed that our first-order language has 
only countably many formulas. So we are potentially employing an exten- 
sion of Chapter | to the case of an uncountable set of sentence symbols, 

A second remark: Now that first-order formulas are also wffs of sentential 
logic, we can apply notion from both Chapters 1 and 2 to them. If I" tau- 
tologically irnplies p, then it follows that I also logically impies p. (See 
Exercise 3.) But the converse fails. For example, Vx Px logically implies 
Pe. But Wx Px does not tautologically imply Pc, as Vx Px and Pc are two 
different sentence symbols, 


Theorem 24B Ijo iff PUA tautologically implies p. 


_ Proof (=>): This depends on the obvious fact that {a, œ > p} tautolog- 
‘cally implies B. Suppose that we have a truth assignment v which satisfies 
“very member of I” U A. By induction we can see that v satisfies any theo- 


tem of I. The inductive step uses exactly the abovementioned obviousfact. 
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(<=): Assume that T'U A tautologically implies p. Then by the corollary 
to the compactness theorem (for sentential logic), there is a finite subset 
Vig hile Pmr ArI wss An} which tautologically implies g. Consequently, 


nA Yn Ao oe +A, > 


is a tautology (cf. Exercise 3 of Section 1.3) and hence is in A. By apply- 
ing modus ponens m + n times to this tautology and to {y1; ->s Yms 
Ais «+ +> An}, We obtain o. a 


(The above proof uses sentential compactness for a possibly uncount- 


able language.) 


Deductions and metatheorems 


We now have completed the description of the set A of logical axioms. 
The set of theorems of a set I is the set generated from F U A by modus 


ponens. For example, 


l 
| 
: 


H Px — 3y Py. 


(Here I’ = Ø; we write “| a” in place of “Q L @.”) The formula Px —> Ay Py 
can be obtained by applying modus ponens (once) to two members of A 
as displayed by the pedigree tree: 


Px => ~Wy-Py | 
Vy Py — Px ' 
(in axiom group 2) 


(Wy “Py — Px) > (Px -> “Vy>Py) 
(in axiom group 1) 


We get a deduction of Px — Ay Py (from ) by compressing this tree int 
a linear three-element sequence. 
As a second example, we can obtain a generalization of the formula i 


the first example: - 


H Wx(Px — 3y Py). 


This fact is evidenced by the following tree, which displays the constructio! 
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of Wx(Px -> dy Py) from A by modus ponens: 


Wx(Px = “~Vy—Py) 


Vx(Vy “Py = Px) 
(in axiom group 2) 


Vx(Vy Py = Px) > Vx(Px > mV y-Py) 
(a member of axiom group 3) 


i 
Wx[(Vy Py > Px) > (Px => ~Vy—Py)] 
(in axiom group 1) 


Again we can compress the tree into a deduction. 

In these examples the pedigree trees may seem to have been pulled out 
of the air. But we will shortly develop techniques for generating such trees 
in a somewhat systematic manner. These techniques will rely heavily on 
the generalization theorem and the deduction theorem below. 

Notice that we use the word “theorem” on two different levels. We say 
that œ is a theorem of I’ if I’. a. We also make numerous statements in 
English, each called a theorem, such as the one below. It seems unlikely 
that any confusion will arise. The English statements could have been labeled 
metatheorems to emphasize that they are results about deductions and 
theorems. 

The generalization theorem reflects our informal feeling that if we can 
prove x without any special assumptions about x, we then are entitled 
to say that “since x was arbitrary, we have Vx__x__.” 


Generalization Theorem If I’; y and x does not occur free in any 
formula in I’, then P H Yx g. 


Proof Consider a fixed set I and a variable x not free in I. We will 
show by induction that for any theorem g of I’, we have I} Vx. For 
this it suffices (by the induction principle) to show that the set 


p: T LVx9} 


includes T'U A and is closed under modus ponens. Notice that x can 
occur free in o. i i 


110 2. FIRST-ORDER LOGIC 


Case 1: y is a logical axiom. Then Wx @ is also a logical axiom. And s 
PELWx oy. 


Case 2: p e I’. Then x does not occur free in g. Hence 


p>Vxp 
is in axiom group 4. Consequently, IE Vx, as is evidenced by | 
Vx | 

DT 


(in I’) (in axiom group 4) 


Case 3: p is obtained by modus ponens from y and p-» gy. Then by 
inductive hypothesis we have 7} Vx y and I j- Yxly >g). This is just 
the situation in which axiom group 3 is useful. That l H Vx@ is evi- 
denced by 


Vx yp 


J\ | 


Vey Yxyp Wx 


SS 
Vx(y > p) Vx(p -> p) > (Wx y > Yx p) 
(in axiom group 3) 


So by induction I b Vx g for every theorem g of I. a 


(The sole reasons for having axiom groups 3 and 4 are indicated by the 
above proof.) 

The restriction that x not occur free in J” is essential. For example, 
Px £ Vx Px, and so bythe soundness theorem of Section 2.5, Px / Vx Px: 
On the other hand, x will in general occur free in the formula p. For exam — 
ple, at the beginning of this subsection we showed first that | 


H (Px — dy Py). 
The second example there, 


H Wx(Px —» 3y Py), 


was obtained from the first example as in case 3 of the above proof. 
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-Exampte Wx Vya + Vy Yxa. 


The proof of the generalization theorem actually yields somewhat more 
than was‘ stated. It shows how we. can, given a deduction of g from I’, 
effectively transform it to obtain a deduction of Vx from r. 


Lemma 24C (rule T) If Pb o,,..., 0; a, and {oy,..., a} tau- 
tologically implies B, then T H- 2. 


Proof 0 -> +++ +O, => B is a tautology, and hence a logical axiom. 
Apply modus ponens n times. a 


Deduction Theorem If J’; y | @, then IH (> gp). 


(The converse clearly holds also; in fact, the converse is essentially the 
rule modus ponens.) 


First proof 


Tr;yho i 3 y) UA tautologically implies p, 


iff PU A tautologically implies (y > p), 
if TH (0p) m 


Second proof The second proof does not use the compactness theorem 
of sentential logic as does the first proof. It shows in a direct way how to 
transform a deduction of p from T ;y to obtain a deduction of (y >p) 
from I’. We show by induction that for every theorem g of I”; y the formula 
(y > p) is a theorem of I’. 

Case 1: p = y. Then obviously | (y =p) > 

Case 2: y is a logical axiom or a member of T’. Then T |- p. And 9 tau- 
tologically implies (y —> p), whence by rule T we have IT p (y= gp). 

. Case 3: y is obtained by modus ponens from y and y —> ¢. By the inductive 
hypothesis, I |- (y =y) and I p (y —> (p —p)) And the set {y —> y, 
y.—> (p — p)} tautologically implies y — p. Thus, by rule T, + (y+ 7). 
So by induction the conclusion holds for any p deducible from °; ye m 


Corollary 24D (contraposition) T’; 9 Hay iff Ty H my. 


Proof Tsp} my T hpa w 
= I hy mp 
=P sph mp 


by the deduction theorem, 
r, i 
by rule T, 


by modus ponens. 
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(In the second step we use the fact that p => —y tautologically implies 
y-> ay.) By symmetry, the converse holds also. if 


Say that a set of formulas is inconsistent iff for some B, both Band =$ 
are theorems of the set. (In this event, any formula @ is a theorem of the 
set, since B — ~f a is a tautology.) 


Corollary 24E (reductio ad absurdum) If Tso is inconsistent, then 
T} ~ag. 


Proof From the deduction theorem we have I"|- (p-»f) and Tp 
(p-» ~$). And {p —>£p, y— =f} tautologically implies ~p. ` 


EXAMPLE + dx Vyp—> Vy Ixo. 

There are strategic advantages to working backward. 

It suffices to show that dx Vy | Vy Ix g, by the deduction theorem. 

It suffices to show that 3x Vy p H Ix g, by the generalization theorem. 

It suffices to show that ~Vx ~Vy p | ~Vx =p, as this is the same as 
the preceding. 

It suffices to show that Vx ~p |- Vx “Vy, by contraposition (and 
rule T). 
~ It suffices to show that Vx =g | ~Vy¢, by generalization. 

It suffices to show that {Vx ~p, Vy p} is inconsistent, by reductio ad 
absurdum. 

And this is easy: 

1, Wx =p |- ~p by axiom group 2 and modus ponens. 

2. Vyp | @ for the same reason. . 


Lines | and 2 show that {Wx ~p, Vy p} is inconsistent. 


Strategy 


As the preceding example indicates, the generalization and deduction 
theorems (and to a smaller extent the corollaries) will be very useful in 
showing that certain formulas are deducible. But there is still the matter 
of strategy: For a given I” and g, where should one begin in order to show 
that J” |- p? One could, in principle, start enumerating all finite sequences 
of wffs until one encountered a deduction of y from I’. Although this would 
‘be an effective procedure (for reasonable languages) for locating a deduction 
if one exists, it is too inefficient to have more than theoretical interest. 


One technique is to abandon formality and to give in English a proof that _ 


the truth of F implies the truth of p. Then the proof in English can be for- 


a a epee een ce ee pte  eeteeremeetn 
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malized into a legal deduction. (In the coming pages we will see techniques 
for carrying out such a formalization in a reasonably natural way.) 

There are also useful methods based solely on the syntactical form of p. 
Assume then that p is indeed deducible from I’ but that you are seeking 
a proof of this fact. There are several cases: 


1. Suppose that p is (p —> 6). Then it will suffice to show that I’; yp + 0 
(and this will always be possible). 

2, Suppose that p is Vx y. If x does not occur free in I’, then it will suf- 
fice to show that I’ |- y. (Even if x should occur free in I’, the difficulty 
can be circumvented, There will always be a variable y such that I |- Vy vy 
and Vy y% + Vx y. See the re-replacement lemma, Exercise 9.) 

3. Finally, suppose that p is the negation of another formula. 

3a. If p is “(y — 0), then it will suffice to show that |} y and Pj- =0 
(by rule T). And this will always be possible. 

3b. If p is ~y, then of course it will suffice to show that T b wy 

3c, The remaining case is where p is =Y x p. It would suffice to show that — 
I’ |- =ë, where ¢ is substitutable for x in y. (Why?) Unfortunately this is 
not always possible. There are cases in which 


EE axy, 
and yet for every term ¢, 
PY ayy. 


(One such example is = Ø, y = ~(Px > Vy Py).) Contraposition is 
handy here; 


DPjabkaVxy 
iff 
Ls xy j ma. 

(A variation on this is: l; Wyo b-=Wx yp if T; Yay H ~e.) If all else 


fails, one can try reductio ad absurdum. 
EXAMPLE (Q2a) If x does not occur free in œ, then 
| l- (a —> Vx p) <> Vx(a — p). 
To prove this, it suffices (by rule T) to show that 


+ (@— Yx p) > Y x(a > B) 
and 
+ Wx(a — p) -> (æ > Vx p). 
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For the first of these, it suffices (by the deduction and generalization theo- 


rems) to show that 


{(a + Yx p), a} | B. 


But this is easy; Vx p > ĝ is an axiom. 
To obtain the converse, 


H Yxlæ — B) > (a > Vx p), 
it suffices (by the deduction and generalization theorems) to show that 


{Vx(a — B), a} H £. 


This again is easy. 


In the above example we can replace a by —a, B by =f, and use the 
contraposition tautology to obtain the corollary: 


(Q3b) If x does not occur free in a, then 
+ (Ax p - a) + Wx(B - a). 


The reader might want to convince himself that the above formula is valid. 
Frequently an abbreviated style is useful in writing down a proof of 
deducibility, as in the following example. 


EXAMPLE (Eq2) | Vx Vy(x ~ yy = x). 
Proof 

L. Hx æ yax ma xy a x Ax 6. 

2, Fx =x. Ax 5. 


3. Hx = ymy [= x. 1,2;T. 
4, L Wx Vy(x = y= y = x). 3; gen’. 


In line 1, “Ax 6” means that the formula belongs to axiom group 6. 
In line 3, “1, 2; T” means that this line is obtained from lines 1 and 2 by 
rule T. In line 4, “3; gen?” means that the generalization theorem can be 
applied twice to line 3 to yield line 4. In the same spirit we write “MP,” 
“ded,” and “RAA” to refer to modus ponens, the deduction theorem, 
and reductio ad absurdum, respectively. 

It must be emphasized that the four numbered lines above do not con- 
stitute a deduction of Vx VWy(x = y —> y = x). Instead they form a proof 
(in the metalanguage we continue, without justification, to call English) that 
stich a deduction exists. The shortest deduction of Vx Vy(x = y = y = X) 
known to the author is a sequence of seventeen formulas, 
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EXAMPLE | x ~ y—> Wz Pxz—> Wz Pyz. 


Proof 


. H x = y — Pxz—> Pyz, Ax 6. 

. | Yz Pxz = Pxz. Ax 2. 

. Hx =~ y — Wz Pxz = Pyz. 1, 2; T. 

. {x = y, Yz Pxz} | Pyz. 3; MP?. 

. {x =~ y, Yz Pxz} } Wz Pyz. 4; gen. 

. H x = y = Vz Pxz = Vz Pyz. 5; deď?. E 


AUV A WN 


EXAMPLE (Eq5) Let f be a two-place function symbol. Then 
FY Vx Vy Vya = Yi => Xz = Ya => [Xe = JYV). 
Proof Two members of axiom group 6 are 


Xy = Yy > SIX = fX Xg => XXa = JYX, 


Xa = Vo => fX1Xa = JY1Xg > XXa = JY Ya. 
From Vx x ~ x (in axiom group 5) we deduce 
faixa = XiX. 
The three displayed formulas tautologically imply 
Xy = Yy > Xg = Ya => fX Xo = Ma- i 


Exampiy (a) {WV x(Px -> Ox), Wz Pz} |- Qe. It is not hard to show that 
such a deduction exists.’ The deduction itself consists of seven formulas. 

(b) {Vx(Px > Qx), Vz Pz} } Qy. This is just like (a). The point we are 
interested in here is that we can use the same seven-element deduction, with 
c replaced throughout by y. 

(c) {Wx(Px —> Qx), Yz Pz} | Yy Qy. This follows from (b) by the gen- 
eralization theorem. 

(d) {Vx(Px > Ox), Yz Pz} |} Vx Qx. This follows from (c) by use of © 
the fact that Vy Oy | Vx Ox. 
Parts (a) and (b) of the foregoing example illustrate a sort of interchange- 


ability of constant symbols with free variables. This interchangeability is 
the basis for the following variation on the generalization theorem, for 


Which part (c) is an example. Part (d) is covered by Corollary 24G. g% is, 
(Of course, the result of replacing ¢ by y in g. 
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Theorem 24F (generalization on constants) Assume that I} and that 
c is a constant symbol which does not occur in P. Then there is a variable 
-y (which does not occur in g) such that I” |- Wy wg. Furthermore, there is 
a deduction of Wy $ from J" in which ce does not occur. 


Proof Let {@, ..-,@,> be a deduction of » from I. (Thus œ, = p.) 
Let y be the first variable which does not occur in any of the a;’s. We claim 
that 


(«) (af, tae s (%)g> 


is a deduction from I" of pg. So we must check that each (o,)g isin ZU A 
or is obtained from earlier formulas by modus ponens. 


Case ly a, € I’. Then c does not occur in a,. So (œ) = = 0%, which is 
in L, 

Case 2: a, is a logical axiom. Then (œ+) is also a logical axiom. (Read 
the list of logical axioms and note that introducing a new variable will 
transform a logical axiom into another one.) 

Case 3: a, is obtained by modus ponens from œ; and a; (== (œ; > @)) 
for i, j less than k. Then (@)g = ((a,)f > (a,)£). So (a,)¢ is obtained by modus 
ponens from (o,;)f and (a,)g. 


This completes the proof that (+) above is a deduction of pg. Let © be 
the finite subset of J” which is used in (+). Thus (*) is a deduction of pi 
from ®, and y does not occur in ®. So by the generalization theorem, 


® | Vy wg. Furthermore, there is a deduction of Vy pg from © in which 


c does not appear. (For the proof to the generalization theorem did not add 
any new symbols to a deduction.) This is also a deduction from I" of 
Vyp. E 


We will sometimes want to apply this theorem in circumstances in which 


not just any variable will do. In the following version, there i is a variable | 


x selected in advance. 


Corollary 24G Assume that T|- 4, where the constant symbol ¢ does 
not occur in J" or in ø. Then T } Yx g, and there is a deduction of Yx g 
from J’ in which c does not occur. 


Proof By the above theorem we have a deduction (without c) from 
T of Vyd), where y does not occur in yv%. But since c does not occur 
in 9, 

(POs = Py 


SEREA Saeco AE EEN 
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It remains to show that Vy gë | Vx g. We can easily do this if we know 


that 


(Vy pi) +> 


is an axiom. That is, x must be substitutable for y in g, and (p must 
be ø. This is pessonably clear; for details see the re-replacement lemma 
(Exercise 9), E 


Corollary 24H (rule ED Assume that the constant symbol c does not 
occur in p, y, or T, and that 
gb y. 
Then 
Pyaxge ty 


and there is a deduction of y from J"; Jx in which c does not occur. 


Proof By contraposition we have 
| Py ap bogs. 
So by the preceding corollary we obtain 
LP; = + Vx ~g. 
Applying contraposition again, we have the desired result. B 


“BI” stands for “existential instantiation,” a bit of traditional termi- 
nology. 

We will not have occasion to use rule EI in any of our proofs, but it may 
be handy in exercises. It is the formal counterpart to the reasoning: “We 
know there is an x such that _.x__. So call it c. Now from —c— we can 
prove y.” But notice that rule EI does not claim that 3x p | p3, which is 


in fact usually false. 


EXAMPLE, revisited |- dx Vy p—> Wy dx. By the deduction theorem, 


- It suffices to show that 


dx Vy p + Wy dx yp. 
By rule EI it suffices to show that 


Vy oi H Vy dx 9, 
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where c is new to the language. By the generalization theorem it suffices 
to show that 


want <x, y> to be a pair of distinct variables, but often it makes no difference 
whether the pair is <v, ¥)> Or <¥,, 01). 


Vy 9% | Ax @. But when it comes time to substitute a term Z into a formula, then the 
, : choice of quantified variables can make the difference between the sub- 
Since Vy g3 | p4, it suffices to show that stitutability of ¢ and its failure. In this subsection we will discuss what to 
gk Ixo. do when substitutability fails. As will be seen, the difficulty can always be 
surmounted by suitably juggling the quantified variables. 
Vx =~ | =i, H Wx Wy Pxy -> Wy Pyy. 


There is the difficulty that y is not substitutable for x in Wy Pxy, so the 
above sentence is not in axiom group 2. This is a nuisance resulting from 
an unfortunate choice of variables. For example, showing that 


which is trivial (by axiom group 2 and modus:ponens). 


We can now see roughly how our particular list of logical axioms was 
formed. The tautologies were included to handle the sentential connective 
symbols. (We could economize considerably at this point by using only 
some of the tautologies.) Axiom group 2 reflects the intended meaning of 
the quantifier symbol. Then in order to be able to prove the generalization 


By contraposition this is equivalent to For example, suppose we want to show that 
| 
| H Wx Wz Pxz = Wy Pyy 
| 
| 


involves no such difficulties. So we can solve our original problem if we | 


theorem we added axiom groups 3 and 4 and arranged for generalizations | ` know that 
of axioms to be axioms. | L Yx Vy Pxy >» Wx Wz Pxz, 

Axiom groups 5 and 6 will turn out to be just enough to prove the erueml, | 
properties of equality; see the subsection on equality. 7 which, again, involves no difficulties. 

As we will prove in Section 2.5, every logical axiom is a valid formula. | This slightly circuitous strategy (of interpolating Wx Vz Pxz between 
It might seem simpler to use as logical axioms the set of all valid formulas. : ‘Wx Wy Pxy and Vy Pyy) is typical of a certain class of problems. Say that 
But there are two (related) objections to doing this. For one, the notion we desire to substitute a term ¢ for x in a wff o. If ¢ is, in fact, not so sub- 
of validity was defined semantically. That is, the definition referred to stitutable, then we replace Vx@ by Vx g', where’ ¢ is substitutable for x 
possible meanings (i.e., structures) for the language and to the notion of | in y'. In the above example y is Vy Pxy and y' is Wz Pxz. In general g’ 
truth in a structure. For our present purposes (e.g., proving that the valid- | will differ from g only in the choice of quantified variables. But gy’ must be 
ities are effectively enumerable) we need a class A with a finitary, syntactical | formed in a reasonable way so as to be logically equivalent to my. For exam- 
definition. That is, the definition of A involves only matters concerning the | ple, it would be unreasonable to replace Yy Pxy by Wx Pxx, or Yy Wz Oxyz 
arrangement of the symbols in the logical axioms; there is no reference to by Vz Wz Oxzz. 


matters of truth in structures. A second objection to the inclusion of all 
valid formulas as axioms is that we prefer a decidable set A, and the set 
of validities fails to be decidable. 


Theorem 241 (existence of alphabetic variants) Let ọ be a formula, t 
a term, and x a variable. Then we can find a formula gy’ (which differs 
from only in the choice of quantified variables) such that 

(a) pg’ and 9’ fg; 

(b) ¢ is substitutable for x in 9’. 


Alphabetic variants 
Often when we are discussing a formula such as 


Vx(x ae 0 => Ay x = Sy) 


iene mie 


` Proof We consider fixed ¢ and x, and construct gy’ by recursion on 9. 
The first cases are simple: For atomic we take oy’ = p, and then (apy 


we are not interested in the particular choice of the variables x and y. We = (-9'), @ > y) = (> y’). But now consider the choice of (Vy py’. 
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If y does not occur in ¢, or if y = x, then we can just take (Vy p} = Vyq'. 
But for the general case we must change the variable. 

Choose a variable z which does not occur in gy’ or ¢ or x. Then define 
(Vy p) = V2(p')}. To verify that (b) holds, we note that z does not occur 
in ¢ and ¢ is substitutable for x in m’ (by the inductive hypothesis). Hence 
(since x + z) t is also substitutable for x in (p')!. To verify that (a) holds, 
we calculate: 


gk’ by the inductive hypothesis; 
UWP Vg. 
Yyy Hp since z does not occur in 9’; 


Vyp | Vee: 
Vy o H Yz). 


by generalization; 


In the other direction, 


Vip) H ND 
pP Hop 

< Vz) H p; 

c Nzi Yro 


which is g’ by Exercise 9; 
by the inductive hypothesis; 


by generalization. 


The last,step uses the fact that y does not occur free in (y')¥ unless y = z, 
and so does not occur free in Vz(’)¥ in any case. E 


The formulas ¢’ constructed as in the proof of this theorem will be called 
alphabetic variants of œ. The moral of the theorem is: One should not be 
daunted by failure of substitutability; the right alphabetic variant will 
avoid the difficulty. 


Equality 


We list here (assuming that our language includes =) the facts about 
equality which will be needed in the next section. First, the relation defined 
by vı = v, is reflexive, symmetric, and transitive (i.e., is an equivalence 
relation): l 


Eql: j- Vxx = x. 
Proof Axiom group 5. a 


Eq2: H Vx Ypi = y= y = x). 


EEEE EE T moa deena aero rome reenter enone ee ENT eS ES 
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Proof Page 114. 
Eq3: / Vx Vy Yz ~ yoy ~ Z=> X = Z). 
Proof Exercise 11. a 


In addition, we will need to know that equality is compatible with the 
predicate and function symbols: 


Eq4 (for a two-place predicate symbol P): 
| H Wx Wx, Wy Vy = Yi > Xa = Ya PxiXx = Py1y2)- 
Similarly for n-place predicate symbols. 
Proof \t suffices to show that 
{x1 = Jis X2 = Vas Px} H Pyy. 


This is obtained by application of modus ponens to the two members of 
axiom group 6: 
X = Yı > PXK => PVX 
Xa = Ya => PYiXo > PYY E 
Eq5 (for a two-place function symbol f): 
LW Vx Yy Vya = Yy => Xa = Yo > fX1Xa az JYV). 


Similarly for n-place function symbols. 


Proof Page 115. fa 


Final comments 


A logic book in the bootstrap tradition might well begin with this section 
on a deductive calculus. Such a book would first state the logical axioms 
and the rules of inference and would explain that they are acceptable to 


‘reasonable people. Then it would proceed to show that many formulas 


were deducible (or deducible from certain nonlogical axioms such as axioms 


for set theory). 
Our viewpoint is'very different. We study, among other things, the facts 


about the procedure described in the preceding paragraph. And we employ 


in this any correct mathematical reasoning, whether or not such reasoning 


is known to have counterparts in the deductive calculus under study. 


| 
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Formal 
language 
expressions: 


Axioms: 


Deductions: Qe 


Hx(Px —> Wx Px) 


Figure 8. The metalanguage above, in which we study the object language below. 
Figure 8 is intended to illustrate the separation between (a) the level 


at which we carry out our reasoning and prove our results, and (b) the 
level of the deductive calculus which we study. 


EXERCISES 


1. For a term u, let uf be the expression obtained from u by replacing 


the variable x by the term ¢. Restate this definition without using any form 
of the word “replace” or its synonyms. Suggestion: Use recursion on u 


(Observe that from the new definition it is obvious that uf is itself a term.) - 


2. To which axiom groups, if any, do each of the following formulas 


belong? 


IROL Teer err antares 
eoninenrevee=rancannencrenmnnreentecutanmiosiineanmunsnanensbinmtbit 
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(a) (Yx Px > Vy Py) — Pz] [Vx Px > (Wy Py > Pz)). 
(b) Wy[Vx(Px => Px) > (Pe — Pc}. 
(c) Wx Ay Pxy — dy Pyy. 


3. (a) Let Ù be a structure and let s : V — |A |. Define a truth assign- 


‘ment v on the set of prime formulas by 


va) =T iff Foo [s]. 


Show that for any formula (prime or not), 
wa) =T iff Hua [5]. 
(b) Conclude that if I tautologically implies p, then I’ logically implies g. 


4. Give a deduction (from @) of Vx p —> Ax p. (Note that you should 
not merely prove that such a deduction exists. You are instead asked to 


write out the entire deduction.) 


5, Find a function f such that if a formula p has a deduction of length . 


n from a set I’, and if x does not occur free in F, then Wx g has a deduction 
from I of length f(n). The more slowly your function grows, the better. 


6. (a) Show, that if æ — £, then H Vxa—> Vx p. 
(b) Show that it is not in general true that œ => B H Yxa — Wx B. 


7. (a) Show that + Ax(Px -> Wx Px). 
(b) Show that {Qx,Vy(Qy > Wz Pz)} | Wx Px. 


8. (Q2b) Assume that x does not occur free in a Show that 
b (a> Ax p) > Ixa —> P). 
As a corollary conclude that, under the same assumption, we have Q3a: 
L (Wx Bra) > Ax(B > a). 


9, (Re-replacement lemma) (a) Show by example that (yp) is not in 
general equal to p. And that it is possible both for x to occur in (py) at 
a place where it does not occur in p, and for x to occur in ọ at a place where 


it does not occur in (py)s- 
(b) Show that if y does not occur at all in p, then x is substitutable for 


y in gë and (p4 = p. Suggestion: Use induction on g. 


10. Show that 
Vx Vy Pxy | Vy Vx Pyx. 
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11. (Eq3) Show that 
H Wx Wy Vz(x = y => y = z -> x ~ z). 


12. Show that any consistent set T of formulas can be extended to a 
consistent set A having the property that for any formula a, either a € 4 
or (a) € A. (Assume that the language is countable. Do not use the 
compactness theorem of sentential logic.) 


13. Form a deductive calculus for sentential logic by taking’ as logical 
axioms the set of tautologies, and taking modus ponens as the rule of in- 
ference. Show that for a set I" ; œ of wffs of sentential logic, œ is a theorem of 
I’ iff I tautologically implies œ. 


14. Show that | Py <> Vx(x = y => Px). 


15. Show that deductions (from @) of the following formulas exist: 
(a) Ax(a v p) + Ixa v Fxg. 

(b) Vxav Vx B-> Vx(a v p). 

(c) Ax(a a p) —> Ixa a Fxg. 

(d) Vx(a—» B) + (Ax a — Ix p). 

(e) Ax(Py A Qx) <> Py a Ax Ox. 


§2.5 SOUNDNESS AND COMPLETENESS THEOREMS 


In this section we. establish two major theorems: the soundness of our 
deductive calculus (I | p = I’ H g) and its completeness LEp=I} 9g). 
We will then be able to draw a number of interesting conclusions (including 

-the compactness and enumeration theorems), Although our deductive 
calculus was chosen in a somewhat arbitrary way, the significant fact is 
that some such deductive calculus is sound and complete. This should be 
encouraging to the “working mathematician” concerned about the existence 
of proofs from axioms; see the Retrospectus subsection of Section 2.6. 


Soundness Theorem If IH o, then I Fg. 


The idea of the proof is that the logical axioms are logically implied 
by anything, and that modus ponens preserves logical implications. 


Lemma 25A Every logical axiom is valid: 


Proof of the soundness theorem, assuming the lemma We show by induc- 
tion that any formula œ deducible from I is logically implied by I’ 


otherness 
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Case 1: p is a logical axiom. Then by the lemma | g, so a fortiori I E ¢. 

Case 2: p e I’. Then clearly T 9. 

Case 3: y is obtained by modus ponens from y and y => p, where (by 
the inductive hypothesis) I y and T F (y => p). It then follows at once 
that I Hø (cf. Exercise 3 of the preceding section). E 


It remains, of course, to prove that lemma. We know from Exercise 6 
of Section 2.2 that any generalization of a valid formula is valid. So ‘it 
suffices to consider only logical axioms which are not themselves general- 
izations of other axioms. We will examine the various axiom groups in 


- order of complexity. 


Axiom group 3: See Exercise 3 of Section 2.2. 

Axiom group 4: See Exercise 4 of Section 2.2. l 

Axiom group 5: Trivial. Ų satisfies x = x with s iff s(x) = s(x), which 
is always true. l i 

Axiom group 1: We know from Exercise 3 of the preceding section that 
if Ø tautologically implies œ, then @ F œ. And that is just what we need. 

Axiom group 6 (for an example, see Exercise 5 of Section 2.2.): Assume 
that æ is atomic and a’ is obtained from œ by replacing x at some places by 
y. It suffices to show that 


{x = y, a} Fa’. 
So take any YW, s such that , 


Eq x = y fs], ie., s(x) = s(y). 


Then any term ¢ has the property that if ¢’ is obtained from ¢ by replacing 
x at some places by p, then 5(¢) = S(t’). This is obvious; a full proof would 
use induction on ft. l l 

If œ is f ~ t, then c’ must be ti ~ t} where £; is obtained from t; as 
described: 


Faas] iff S) =S), 
iff SC) = 5h), 
iff Fao’ [s]. 
Similarly, if œ is Pf, +++ fn, then æ’ is Pry ++- tn and an analogous argument 


. applies. 


Finally, we come to axiom group 2. It will be helpful to consider first 
a simple example; we will show that Vx Px — Pt is valid. Assume that 


Fg Vx Px [s]. 
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Then for any d in |% |, 


Fu Px [s(x|d)]. 
So in particular we may take d = &(t): | 
(a) Fu Px [s(x|5@))]. 


This is equivalent (by the definition of satisfaction of atomic formulas) to 
S(t) € PY, 

which in turn is equivalent to 

(b) Fu Pt [s]. 


For this argument to be applicable to the nonatomic case, we need a 
way of passing from (a) to (b). This will be provided by the substitution 
lemma below, which states that 


Fu p [s(x]5@))] 


whenever ¢ is substitutable for x in g. 


Consider a fixed Ù and s. For any term u, let už be the result of replacing 
the variable x in u by the term ¢, 


iff Fag? [s] 


Lemma 25B (u7) = EEO 


. This asserts that a substitution can be carried out either in the term u 
or in s, with equivalent results. The corresponding diagram is 


substitution | 
terms + terms 
of t for x 
$(x|5(2)) Š 
RI 


Proof By induction on the term u. If u is a constant symbol or a vari- 
able other than x, then uf = u and the desired equation reduces to 5(u) = 
5(u). If u = x, then the equation reduces to s(t) = 5(t). The inductive step, 
although cumbersome to write, is mathematically trivial. ia 


The substitution lemma is similar in spirit; it states that a substitution 
can be carried out either within ø or in s, with equivalent results. For an 
example see Exercise 10 of Section 2.2. 
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Substitution Lemma If the term ¢ is substitutable for the variable x 
in the wff y, then | 
Eu gels] if Fag isle. 

Proof We use induction on ¢ to show that the above holds for every s. 


Case 1: p is atomic. Then the conclusion follows from the preceding 
lemma. For example, if p is Pu for some term u, then 
ku Pu [s] iff 5G) © PY, 
iff s(x|5(2))(u) € PY 


iff Hy Pu [o(x|5())]. 


by Lemma 25B, 


Case 2: p is =y or p —= 0. Then the conclusion for p follows at once 
from the inductive hypotheses for y and 0. 

Case 3: œ is Vyp, and x does not occur -free in p. Then s and 5(x|5(t)) 
agree on all variables which occur free in p. And also gj is just p. So the 
conclusion is immediate. 

Case 4: p is Vy y, and x does occur free in p. Because ¢ is substitutable 
for x in p, we know that y does not occur in ¢ and ¢ is substitutable for x 
in y (see the definition of “substitutable”’). By the first of these, 


(2) a) = sD) 
for any d in |Ù |. Since x Ay, pë = Vy pr. 
Ew of ls] it’ for every d, Fu vf [s01 

iff for every d, Fuy BOIAN by the inductive hypo- 
thesis and (+); | 

iff Ha p lO. 


So by induction the lemma holds for all g. 


Axiom group 2: Assume that t is substitutable for x in g. Assume that Ñ 
satisfies Vx p with s. We need to show that Hy p? Is]. We know that for any 
d in |X], 
g Eor p [s(x|@)], 


Th particular, let d = S(t): 
Ha p [s(x] 5(4))). 
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So, by the substitution lemma, 


Fa p? ls]. 


Hence Vx p — gf is valid. 


This completes the proof that all logical axioms are valid. And so the 
soundness theorem is proved. 


Corollary 25C If Hepy), then p and y are logically equivalent. 


Corollary 25D If q’ is an alphabetic variant of y (see Theorem 241), 
then » and q’ are logically equivalent. . 


Recall that a set I’ is consistent iff there is no formula y such that both 


Ipo and lH ~ag. Define I to be satisfiable iff there is some Ñ and s 
such that Ñ satisfies every member of I with s. 


Corollary 25E If J’ is satisfiable, then T is consistent. 


This corollary is actually equivalent to the soundness theorem, as the 
reader is invited to verify. 


The completeness theorem is the converse to the soundness theorem 
and is a deeper result. 


Completeness Theorem (Gödel, 1930) 


(a) lf CE, then TH- g. 
(b) Any consistent set of formulas is satisfiable. 


Actually parts (a) and (b) are equivalent; cf. Exercise 2. So it suffices 
to prove part (b). We will give a proof for a countable language; later we 
will indicate what alterations are needed for languages of larger estdinabty.: 
(A countable language is one with countably many symbols, or equivalently 
(by Theorem OB) one with countably many wiffs.) 

The ideas of the proof are related to thgse in the proof of the compact- 
ness theorem for sentential logic. We begin with a consistent set J. In 
steps 1-3 we extend J’ to a set A of formulas for which . 


areca. 


l (ii) A is consistent and is maximal in tle sense that for any formula @ 
either aE A or (ma) € A. 


(iii) For any formula g and variable x, there is a c such that 


(Yx pg => ag?) € A. 
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Then in step 4 we form a structure AÑ in which members of J" not containing 
= can be satisfied. |Ù | is the set of terms, and for a predicate symbol P, 


| ltic p ERT iff Ph e t,€ A. 


Finally, in step 5 we change Ñ to accommodate formulas containing the 
equality symbol. 

It is suggested that on a first reading the details which are provided for 
most of the steps be omitted. Once the outline is clearly in mind, the entire 
proof should be read. (The nondetails are marked with a stripe in the left 


margin.) 
Proof Let I’ be a consistent set of wffs in a countable language. 


Step 1: Expand the language by adding a countably infinite ‘set of 
| new constant symbols. Then T remains consistent as a set of wffs in 
the new language. 


Details: If not then for some £, there is a deduction (in the expanded 
language) of (Ba =p) from T. This deduction contains only finitely © 


many of the new constant symbols. By the theorem for generalization on 
constants (Theorem 24F), each can be replaced by a variable. We then have 
a deduction (in the original language) of (B' A mB") from T'. This contradicts 
our assumption that J” was consistent. i i 

Step 2: For each wff p (in, the new language) and each variable 
x, we want to add to T the wif 


“VX D> 9%, 


where c is one of the new constant symbols. (The idea is that c volun- 
teers to provide a counterexample to 9, if there is any.) We can do 
this in such a way that J" together with the set O of all the added wffs 


is still a consistent set. 


Details: Adopt a fixed enumeration of the pairs <p, x>, where p is a wif 
(of the expanded language) and x is a variable: 


Ps ’ XD» Pa , X» LP, X3> tabi! Ss 
This is. possible since the language is countable. Let 6, be 
AV xy Pi > ayy > 


where c, is the first of the new constant symbols not occurring in pı. Then 
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go on to <Pz, X> and define 0,. In general, 6,, is 


WX) Pn => “Pn oh > 


where c, is the first of the new constant symbols not occurring in p, or in 
Oy; for any k <n. 

Let © be the set {6,, 03, 
then (because deductions are finite) for some m > 0, 


Lu {,, reas Ons Oma} 
is inconsistent. Take the least such m. Then by RAA 


Pu {9,, mney On? }- BUE l 
Now Omp is 
=~ x p => p7 
for some x, p, and c. So by rule T, we obtain the two facts: 
Pu {),, PAP ay Din} |- AV XD, 
Lu {0;, cue O gh Hoz. 


Since ¢ does not appear in any formula on the left side, we can apply the 
Corollary 24G to the second of these, obtaining 


PU {0,, 0.5 Om} EW. 


This and (+) contradict the leastness of m (or the consistency of J’, if 
m= 0), - 


(*) 


Step 3: We now extend the consistent set I U @ to a consistent set 
A which is maximal in the sense that for any wff ọ either p e A or 


(mg) € A. 


\ 

Details: We can imitate the proof used at the analogous place in the 
proof of sentential compactness in Section 1.7. Or we can argue as follows: 
Let A be the set of logical axioms for the expanded language. Since P U O 


is consistent, there is no formula £ such that 7 U © U A tautologically _ 


implies both 8 and =$. (This is by Theorem 24B; the compactness theorem 
of sentential logic is here used.) Hence there is a truth assignment v for the 
set of all prime formulas which satisfies (every member of) l U © u A. 
Let , 

. A = {p : Kp) = T}. 


.}. We claim that J’ U @ is consistent. If not, . | 
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Clearly for any » either p e A or (~p) €e A but not both. Also we have 


A|-p =A tautologically implies p l 
= vy) =T 


(since A S A), 
since v satisfies A, 
=p EA. 


Consequently, A is consistent, lest both » and (~y) belong to 4. 


| Actually, regardless of how 4 is constructed, it must be deductively 
| closed. For 

| AH p= A k nagp by consistency, 
| 

| 


=p EA by maximality. 

Step 4: We now make from J a structure Ñ for the new language, 
but with the equality symbol (if any) replaced by a new two-place 
predicate symbol Æ. Y will not itself be the structure in which J” will 
be satisfied but will be a preliminary structure. 


(a) |W |:= the set of all terms of the new language. 
(b) Define the binary relation E% by 


<u, t) € EM iff the formula u ~ z belongs to A. 


(c) For each n-place predicate parameter P, define the n-ary rela- 
tion P% by 


Chs s bp E PY iff Phe t, € A. 


(d) For each v-place function symbol f, let f% be the function defined 
i l by 
SM, sey tn) = fh rte lns 


‘This includes the n = 0 case; for a constant symbol cwe take c = c. 
Define also a function s:V¥—|%|, namely the identity function 
on V. : re wn 

It then follows that for any term f, a(t) = ¢. For any wif g, let o* 
be the result of replacing the equality symbol in » by E. Then 


Fo p* [s] - 


Details: That 5(#) = ¢ can be proved by induction on f, but the proof 
18 entirely straightforward. . 


iff ped. 


132 2. FIRST-ORDER LOGIC 2.5 Soundness and Completeness Theorems 133 
The other claim, that 


Ey p* [s] iff pe A, 


(The notational ambiguity is harmless since Vx(p*) is the same as (Wx ¢)*.) 
A includes the wf 0: 


“Wx p > 9%. 


we prove by induction on the number of places at which connective or To show that 


quantifier symbols appear in 9. -Fu Vx g* [s] = Vxp E A, 


we can argue: If p* is true of everything, then it is true of c, whence by the 
inductive hypothesis y3 e 4. But then Vx p € 4, because c was chosen to 
be a counterexample to ¢ if there was one. In more detail: 


Ey Vx p* [s] => Eu p* islo 


Case 1: ‘Atomic formulas. We defined Y in such a way as to make this 
case immediate. For example, if p is Pt, then 


Fu Pt[s] iff s(t) © PY, 
iff te PY, 


if Pre A => Eq (p*) [s] by the substitution lemma 
Similarly => Fg (p2)* [s], this being the same formula 
Hu vEt [s] iff C5(w), S(O) € BY, AAT by the tadlicuve hypothesis 
| => (~g) A by consistency 


iff <u, D € EY, 
iff u= ted. 


=> (Vx 9) ¢ A since 0 € A and A is deductively closed 


=> Vxp EÁ. 
Case 2: Negation. 


Fa(—p)* s] iff Fap” [s], 
; iff p¢ A by inductive hypothesis, 


(This is our only use of ©. We needed to know that if (Vx p) € A, then 
for a particular c we would have (79%) eA.) 
We turn now to the converse. We can almost argue as follows: 


iff (-y)e 4 by properties of A. Ku Wx p* [s] => For p* [s(x]2)] for some ¢ 


Case 3: Conditional. a> Ly (p7)* is] by the substitution lemma 
ie => EA by the inductive hypothesis 

i= % ff * * 

RO CEG: eee =a VxpEA since A is deductively closed... 


if pgA or ped by inductive hypothesis, 


iff (—p)eA or yed, The flaw here is that the two wavy implications require that ¢ be substi- - 
tutable for x in p. This may not be the case, but we can use the usual repair: 
We change to an alphabetic variant y of pọ in which £ is subsist for x. 


‘Then 


Ky Wx p* [s] => Ku p* [s@|2)] for some 4, Ganeerorth fixed 
=> Ky y* isl by the semantical equivalence of alpha- 
betic variants (Corollary 25D) 


=>Alk(p—>y), in fact tautologically, 
=p¢ń or [peá and A} yl, 
=>(—p)Ed or ped, 


which closes the loop. And 


AF@>») iff pyped. => Ly (yi* [s] by the substitution lemma 
(This should be compared with Exercise 2 of Section 1.7.) Kane by the inductive hypothesis 
Case 4: Quantification. We want to show that =Vxyg A since A is deductively closed 
=>Vxøg A by the syntactical equivalence of alpha- 


Ea Vx p* [s] iff Yxp e. betic variants (Theorem 241). 


| 
| 
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This completes the list of possible cases; it now follows by induction th 


for any 9, 


Fugt is] iff pe A. 

` If our original language did not include the equality symbol, then 
we are done. For we need only restrict Ù% to the original language to 
obtain a structure which satisfies every member of I" with the identity 


function. But now assume that the equality symbol is in the language. : 


Then Y will no longer serve. For example, if J” contains the sentence 
c = d (where c and d are distinct constant symbols), then we need 
a structure B in which c® = d”, We obtain 8 as the quotient structure 
W/E of Ù modulo EX, 


Step 5: E% is an équivalence relation on | % |. For each ¢ in |N] 
‘let [¢] be its equivalence class. Æ% is, in fact, a congruence relation for 
A. This means that the following conditions are met: 


(i) E" is an equivalence relation on | % |. 
(ii) P" is compatible with E% for each predicate symbol P: 


Clos eot EP% and GE% for i<n=>dt,..., td © P% 


(iii) f” is compatible with Æ% for each function symbol f: 
GE for ix n= fey, ...,t,) ESSU, 4). 


Under these circumstances we can form the quotient structure W/E, 
defined as follows: 


(a) | N/E | is the set of all equivalence classes of members of |W |. 
(b) For each n-place predicate symbol P, 


<ia .. «5 [tld E PYE iff <t4,,...,¢,>¢ PY 
(c) For each n-place function symbol F 
SPA, oeo D = 1s 2255 Bb 
This includes the n = 0 cases: 
CWE — fe, 


Let h:|U|—+|W/E| be the natural map: 


A(t) = [r]. 
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Then h is a homomorphism of X onto W/E. Furthermore, E /E is the 

equality relation on | M/E |. Consequently, for any 7: 


p E A <+> ug Is] 
<> Foye P* [hos] 
<> Faz p [ho s]. 


So W/E satisfies every member of A (and hence every member of I’) 


with hos. 
Details: Recall that 
(EY! i @~ te, 
if Abr tl. 


| ies Eql, Bq2, and Eq3 
(i) E% is an equivalence relation on N by properties Eql, Eq 


of equality. 
i PX is compatible with EX by property Eq4 of o 
Gii) f% is compatible with E% by property Eq5 of equality. 
j W/E i 
Tt then follows from the compatibility of PY with EY that a a i 
defined. Similarly, /%Z is well defined because f% is compati Rai a 
It is immediate from the construction that h is a homomorphism 


onto U/E. And 


MEXE e ik EY, 
iff = [e]. 
Finally, 
pede Hy o* [s] by step 4 


i em 
<> Hyg p* [hos] by the homomorphism theor 


ES kum p [hos], 


the last step being justified by the fact that EWE js the equality relation on 


| UE |. 


Step 6: Rest 
restriction of U/E satisfies every membe 


rict the structure W/E to the original language. This 
r of T with hos. A 


a few modifications to the foregoing proof 


For an uncountable language, e language has car- 


ded. Say that th 
` of the completeness. theorem are nee 
dinelit x a this we mean that it has x symbols or, rae RA ae 
ase We will describe the modifications needed, assuming the reader 
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a substantial knowledge of set theory. In step 1 we add x new constant 
symbols; the details remain unchanged. In step 2, only the details change, 
The cardinal is an initial ordinal. (We have tacitly well-ordered the lan- 
guage here.) “Enumerate” the pairs 


Pu > XaJa<u 
indexed by ordinals less than x. For œ < x, 0, is 
AW Xa Pa —> (A , 


where c, is the first of the new constant symbols not in », or in 0; for any 
B <a. (This excludes at most $, - card(a) constant symbols, so there are 
some left.) Finally, in step 3, we can obtain the maximal set A by use of 
Zorn’s lemma. The rest of the proof remains unchanged. 


Compaciness Theorem (a) If I’, then for some finite P, < I" we 
have I) Fo. 


(b) If every finite subset D, of I” is satisfiable, then I” is satisfiable. 


In particular, a set X of sentences has a model iff every finite subset has 
a model, 


To prove part (a) of the compactness theorem, we simply observe that 


TPEgsI po 
=> loho for some finite I, ¢ I, deductions being finite 
=> ly Eg. 


Part (b) has a similar proof. If every finite subset of T'is satisfiable, then 
by soundness every finite subset of I is consistent. Thus I is consistent, 
since deductions are finite. So by completeness, I is satisfiable. (Actually 
parts (a) and (b) are equivalent; cf. Exercise 3 of Section 1.7.) E 


When a person first hears of the compactness theorem, his natural incli- 
nation is to try to combine (by some algebraic or set-theoretic operation) 
the structures in which the various finite subsets are satisfied, in such 2 
way as to obtain a structure in which the entire set is satisfied. In fact, 
such a proof is possible; the operation to use is the u/traproduet construction. 
But we will refrain from digressing further into this intriguing possibility. 

Notice that the compactness theorem involves only semantical notions 
of Section 2.3; it does not involve deductions at all. And there are proofs 
that avoid deductions. The same remarks apply to the following theorem. 
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*Enumerability Theorem For a reasonable language, the set of valid 


wfis can be effectively enumerated. 


By a reasonable language we mean one whose set of parameters can be 


effectively enumerated and such that the two relations 


{(P, n) : P is an n-place predicate symbol} 


and; 


{(f n): f is an n-place function symbol} 


are decidable. For example, any language with only finitely many param- 
eters is certainly reasonable. On the other hand, a reasonable language 
‘must be countable, since we cannot effectively enumerate an uncountable 


set, 


A precise version of this theorem will be given in Section 3.4. (See es- 


pecially item 20 there.) The proofs of the two versions are in essence the 
same, 


Proof The essential fact is that A, and hence the set of deductions, are 


decidable. 


Suppose that we are given some expression €. (The assumption of reason- 
ableness enters already here. There are only countably many things eligible 
to be given by one person to another.) We want to decide whether or not 
eis in A. First we check that e has the syntactical form necessary to be a 
formula. (For sentential logic we gave detailed instructions for such a check; 
see Section 1.4. Similar instructions can be given for first-order languages, 
by using Section 2.3.) If e passes that test, we then check (by constructing 
a truth table) to see if ¢ is a generalization of a tautology. If not, then we 


proceed to see if ¢ has the syntactical form necessary to be in axiom group 2. 


And so forth. If s has not been accepted by the time we finish with axiom 
group 6, then e is not in A. 

(The above is intended to convince the reader that he really can tell 
members of A from nonmembers. The reader who remains dubious can 
look forward to the rerun in Section 3.4.) l i 

Since A is decidable, the set of tautological consequences of A are effec- 
tively enumerable; see Theorem 17G. But 


{a :a is a tautological consequence of A} 
= {a : | a} by Theorem 24B, 
= {a :a is valid}. B 
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An alternative to the last paragraph of this proof is the following argu 
ment, which is possibly more illuminating: First we claim that the set of 
deductions (from @) is decidable. For given a finite sequence a, ...,a, 
we can examine each a, in turn to see if it is in A or is obtainable by modus 
„ponens from earlier members of the sequence. Then to enumerate the va- 
. lidities, we begin by enumerating all finite sequences of wffs. We look at 
each sequence as it is produced and decide whether or not it is a deduction, 
If not, we discard it. But if it is, then we put its last member on the list of 
validities. Continuing in this way, we generate a list on which any valid 
formula will eventually appear. 


*Corollary 25F Let J‘ be a decidable set of formulas in a reasonable 
language. 

(a) The set of theorems of I" is effectively enumerable. 

(b) The set {p : T Hp} of formulas logically implied by I’ is effectively 
enumerable. 


(Of course parts (a) and (b) refer to the same set. This corollary includes 
the enumeration theorem itself, in which T = Ø.) 


Proof 1 Enumerate the validities; whenever you find one of the form 


Ay mr see => Oy => Ay, 


check to see ifa,,..., a, are in I’. If so, then put a, on the list of theorems 
of I’. In this way, any theorem of J’ is eventually listed. E 


Proof 2 T O A is decidable, so its set of tautological consequences is 
effectively enumerable. And that is just the set we want. a 


For example, let I’ be the (decidable) set of axioms for any of the usual 
systems of set theory. Then this corollary tells us that the set of theorems 
of set theory is effectively enumerable. 


*Corollary 25G Assume that I" is a decidable set of formulas in 4 
reasonable language, and for any sentence o either P Ho or P pH 7. 
Then the set of sentences implied by I" is decidable. 


Proof Jf I is inconsistent, then we have just the (decidable) set of al 
sentences, So assume that J’ is consistent. Suppose that we are given 4 
sentence ø and asked to decide whether or not P } ø. We can enumerati 
the theorems of I’ and look for o or =ø. Eventually one will appear, and 
then we know the answer. [A 


2.5 Soundness and Completeness Theorems 139 


(Observe that this proof actually describes two decision procedures. 
One is correct when I is inconsistent, the other is correct when I’ is con- 
sistent. So in either case there is a decision procedure. But we cannot 
necessarily determine effectively, given a finite description of I’, which one 
is to be used.) 

It should: be remarked that our proofs of enumerability cannot, in gen- 
eral, be strengthened to proofs of decidability. For almost all languages 
the set of validities is not decidable. (See Church’s theorem, Section 3,5.) 


Historical notes 


The completeness theorem (for countable languages) was contained in 
the 1930 doctoral dissertation of Kurt Gédel. (It is not to be confused 
with the “Gödel incompleteness theorem,” published in 1931. We will 
consider this latter result in Chapter 3.) The compactness theorem (for 
countable languages) was given as a corollary. 

The compactness theorem for uncountable languages was implicit in a 
1936 paper by Anatolii Mal’cev. His proof used Skolem functions (cf. . 
Section 4.2) and the compactness theorem of sentential logic. The first 
explicit statement of the compactness theorem for uncountable languages. 


was in a 1941 paper by Malcev. 


The enumerability theorem, as well as following from Gödels 1930 
work, was also implicit in results published in 1928 by Thoralf Skolem. 

The proof we have given for the completeness theorem is patterned after 
one given by Leon Henkin in his dissertation, published in 1949. Unlike 
Gédel’s original proof, Henkin’s proof generalizes easily to languages of 
any cardinality. 


ë EXERCISES 


i. (Semantical rule EI) Assume that the constant symbol c does not 
occur in p, p, or I’, and that I’; g} Hy. Show (without using the soundness 
and completeness theorems) that J”; 4x Fy. 


2. Prove the equivalence of parts (a) and (b) of the completeness 
theorem. 


3. Assume that I|- p and that P is a predicate symbol which occurs 
neither in J" nor in g. Is there a deduction of g from J” in which P nowhere 
occurs? 


4. Assume that the language has only finitely many parameters. | 
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(a) Show that we can effectively decide, given a finite structure Y anda 
sentence o, whether or not Fy o. 

(b) Show that the set of sentences having finite models is effectively 
enumerable. 


5. Assume that the language has only finitely many parameters. 


(a) Let X be a set of sentences such that for any o € X, if o has a coun- 
terexample (i.e., an Ñ in which ø is false), then o has such a counterexample 
with |Y | finite. Find an effective procedure which, given any o € 2, will 
decide whether or not o is valid. 

(b) Show that the set of valid Y, sentences’ without function symbols is 
decidable. (See Exercises 18 and 19 of Section 2.2 for terminology and 
background. An V, formula is one of the form Vx, +--+ Vx pp, where p 
is existential.) 


6. Let [= {aV», Pv,, Pr,, Pv,, Pg, .. 
isfiable? 


.}. Is I” consistent? Is J” sat- 


§2.6 MODELS OF THEORIES 


In this section we will leave behind deductions and logical axioms. Instead 
we return to topics discussed in Section 2.2. But now, in the presence of 
the theorems of the preceding section, we will be able to answer more ques- 
tions than we could before. 


Size of models 


In the completeness theorem we started with a set J’ (in a language of 
cardinality! x) and formed a structure Ñ/E in which it was satisfied. N/E 
was constructed from a preliminary structure Ñ. The universe. of Ù was 
the set of all terms in the language obtained by adding x new constant 
symbols. So clearly || contained at least x terms. On the other hand, 
there are only x expressions in the augmented language (by Theorem OD), 
so |X| could have no more than x terms. Thus the cardinality of X (by 
which we mean the cardinality of | |) was x. 

The universe of U/E consisted of equivalence classes of members of %, 
so card |U/E| <card|%|. (We can map |Y/E| one-to-one into |A] 
by assigning to each equivalence class some chosen member.) Thus, when 
the smoke had cleared, J' was satisfied in a structure I/F of cardinality <% 


1 The reader who wishes to avoid uncountable cardinals is advised to skip from part 
(a) of the Léwenheim-Skolem theorem to Theorem 26B. 


| 
| 
| 
| 
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Liwenheim-Skolem Theorem (1915) (a) Let J’ be a satisfiable set of 
formulas in a countable language. Then JI is satisfiable in some countable 
structure. 

(b) Let I’ be a satisfiable set of formulas in a language of cardinality ~. 
Then J” is satisfiable in some structure of cardinality < x. 


Proof (a) is a special case of (b), wherein x = $. To prove (b), first 
observe that I is consistent, by the soundness theorem. Then by the com- 
pleteness theorem (plus the foregoing remarks) it can be satisfied in a struc- 
ture of cardinality < x. a 


(There is another, more direct proof of this theorem that will be indicated 
in Section 4.2; see especially Exercise 1 there. That proof, which does not 
use a deductive calculus, begins with an arbitrary structure Ù in which I" 
can be satisfied, and by various manipulations extracts from it a suitable 
substructure of cardinality + or less.) 

The Léwenheim—Skolem theorem was published by Leopold Léwenheim 
in 1915 for the case where J” is a singleton; Thoralf Skolem in 1920 ex- | 
tended this to a possibly infinite T. The theorem marked a new phase 
in mathematical logic. Earlier work had been done in the direction of for- 
malizing mathematics by means of formal languages and deductive calculi; 
this work was initiated largely by Gottlob Frege in 1879. For example, 
the Principia Mathematica (1910-1913) of Whitehead and Russell carried out 
such a formalization in great detail. But the modern phase began when 
logicians stepped back and began to prove results about the formal systems 
they had been constructing. Other early work in this trend was done by 
Kurt Gédel (as mentioned before), Alfred Tarski, and others. 

For a sample application of the Lowenheim-Skolem theorem, let Asp 
be your favorite set of axioms for set theory. Hopefully these axioms are 
consistent. And so they have some model. By the Lowenheim-Skolem 
theorem, the axioms have a countable model ©. Of course, G is also a 
model of all the sentences logically implied by Asr. One of these sentences 
asserts (when translated back into English according to the intended 
translation) that there are uncountably many sets. There is no contradiction 
here; but the situation is sufficiently puzzling to be called “Skolem’s para- 
dox.” It is true that in the structure © there is no point which satisfies the 
formal definition of being a one-to-one map of the natural numbers onto 
the universe. But this in no way excludes the possibility of there being 
(outside ©) some real function providing such a one-to-one correspondence. 

For a structure Ñ, define the theory of X, Th, to be the set of all sen- 
tences true in Ñ, Suppose that we have an uncountable structure Ñ for a 
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countable language. By the Léwenheim-Skolem theorem (applied to Th W 
there is a countable 8 which is a model of Th N. It follows that Y% = 
for 
kyo => o e Th > Epo 
and 
Kyo = by me = (mo) e ThA = Fg no. 


Conversely, suppose that we start with a countable structure %. Is there 
an uncountable Ñ such that X = B? If V is finite (and the language in- 
cludes equality), then this is impossible. But if % is infinite, then there will 
be such an A, by the following “upward and downward Léwenheim- 
Skolem theorem.” The upward part is due to Tarski, whence the “T” of 
“LST.” ; 


LST Theorem Let I’ be a set of formulas in a language of cardinality 
x, and assume that J” is satisfiable in some infinite structure. Then for every 
cardinal A > x, there is a structure of cardinality 2 in which J” is satisfiable. 


Proof Let % be the infinite structure in which J” is satisfiable. Expand 
the language by adding a set C of A new constant symbols. Let 


X = {c AY Ca : Cy, Cg distinct members of C}. 


Then every finite subset of X U T is satisfiable in the structure A, expanded 
to assign distinct objects to the finitely many new constant symbols in the 
subset. (Since % is infinite, there is room to accommodate any finite number 
of these.) So by compactness X U T'is satisfiable, and by the L6wenheim- 
Skolem theorem it is satisfiable in a structure 8 of cardinality < A. (The 
expanded language has cardinality x + A = 2.) But any model of X clearly 
has cardinality > 4. So % has cardinality 2; restrict 8 to the original lan- 
guage. E 


Corollary 26A (a) Let 2 be a set of sentences in a countable language. 
If X has some infinite model, then X has models of every infinite cardinality. 
(b) Let Ù be an infinite structure for a countable language. Then for any 
infinite cardinal A, there is a structure % of cardinality A such that B= Y. 


Proof (a) Take T'= Z, x= N, in the theorem. (b) Take X= Th 
in part (a). El 


| 
| 
| 
| 
| 


Consider a set X of sentences, to be thought of as nonlogical axioms. 
(For example, Z might be a set of axioms for set theory or a set of axioms 
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for number theory.) Call X categorical iff any two models of X are isomor- 


phic. The above corollary implies that if 2 has any infinite models, then 2 
is not categorical. There is, for example, no set of sentences whose models 
are exactly the structures isomorphic to (N, 0, S, +, +). This is indicative 
of a limitation in the expressiveness of first-order languages. (As will be 
seen in Section 4.1, there are categorical second-order sentences. But sec- 
ond-order sentences are peculiar objects, obtained at the cost of holding 
the notion of subset fixed, immune from interpretation by structures.) 

It is possible to have sentences having only finite models. For example, 
any model of Vx Vy x = y has cardinality 1. But if all models of 2’ are 
finite, then there is a finite bound on the size of the models, by the following 
theorem. 


Theorem 26B Ifa set X of sentences has arbitrarily large finite models, 
then it has an infinite model. 


Proof For each integer k > 2, we can find a sentence A, which translates, 
“There are at least k things.” For example, 


Ay, = Av, Av, 01 AY Pos 
Ag = Av, Av, Avalo, FY Pa A Pa AL Vs A Vi AY Va). 


Consider the set 


Oe T Me EAE 


By hypothesis any finite subset has a model. So by compactness the entire 
set has a model, which clearly must be infinite. a 


Both the proof to this theorem and the proof of the preceding theorem 
illustrate a useful method’ for obtaining a structure with given properties. 
One writes down sentences (possibly in an expanded language) stating the 
properties one wants. One then argues that any finite subset of the sentences 
has a model. The compactness theorem does ihe rest. We will see more 
examples of this method in the coming pages. 


Corollary 26C The class of all finite structures (for a fixed language) 
is not.-EC,. The class of all infinite structures is not EC. 


Proof The first sentence follows immediately from the theorem. If the 
class of all infinite structures is Mod z, then the class of all finite structures 
is Mod =r. But this class isn’t even EC, much less EC. a 
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The class of infinite structures is EC,, being Mod{A,, As, ...}. 

For example, it is a priori conceivable that there might be some ver 
subtle equation of group theory that was true in every finite group but fal 
in every infinite group. But by the above theorem, no such equation exists, 


EXAMPLE Consider the structure 
R= (N, 0, S, <, +, *). 


By the LST theorem there are structures elementarily equivalent to 9 
which are uncountable (and hence not isomorphic to $). We claim now 
that there is also a countable structure M, elementarily equivalent to 
but not isomorphic to, the structure , 


Proof Expand the language by adding a new constant symbol e. Let 
X= {0 <c, S0<c, SS0 <c, alta te 


We claim that £ U Th® has a model. For consider a finite subset. 
That finite subset is true in l 


N, = (N, 0, S, <, +, +, k) 


(where k = c®) for some large k. So by the compactness theorem X U Th It 
has a model. 


By the Léwenheim-Skolem theorem, £ U Th N has a countable model 
M = (| M |, O*, SN, <N, 4 M, M omy, 
Let Mo be the restriction of M to the original language: 
My = (| M |, 09, SM, <M, pm, my, 


Since M, is a model of Th N, we have M, = 9. We leave it to the 
reader to show that M, is not isomorphic to M. li 


Theories 


We define a theory to be a set of sentences closed under logical implica- 
tion. That is, T'is a theory iff T is a set of sentences such that for any sentence 
o of the language, 


TEo=oeT. 


(Note that we admit only sentences, not formulas with free variables.) 
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For example, there is always a smallest theory, consisting of the valid 
sentences of the language. At the other extreme there is the theory consist- 
ing of all the sentences of the language; it is the only unsatisfiable theory. 

For a class & of structures (for the language), define the theory of H 
(Th Z) by the equation 


Th = {o : æ is true in every member of %}. 
(This concept arose previously in the special case B= M.) 
Theorem 26D Th I is indeed a theory. 


Proof Any member of 4 is a model of Th% Thus if % is true in every 
model of Th.% then it is true in every member of 4 Whence it belongs 
to TRX. m 


For example, if the parameters of the language are V, 0, 1, +, and °, 
and F is the class of all fields, then Th & the theory of fields, is simply the 
set of all sentences of the language which are true in all fields. If @ is the . 
class of fields of characteristic 0, then Th Z is the theory of fields of 
characteristic 0, 

Recall that for a set X of sentences, we defined Mod % to be the class 
of all models of X. Th Mod» is then the set of all sentences which are 
true in all models of X. But this is just the set of all sentences logically im- 
plied by X. Call this set the set of consequences of X, Cn X. Thus 


Cn 2 = {0:2 Fo} 
= Th Mod 2. 


For example, set theory is the set of consequences of a certain set of sen- 
tences, known, unsurprisingly, as axioms for set theory. A set T of sentences 
is a theory iff T = Cn T. 

A theory T is complete iff for every sentence o, either o € T or (no) € T. 
For example, for any one structure Y, Th {WX} (written, as before, “Th WY”) 
is always a complete theory. In fact, it is clear upon reflection that The 
is a complete theory iff any two members of % are elementarily equivalent. 
And a theory T is complete iff any two models of T are elementarily equiv- 
alent, 

For example, the theory of fields is not complete, since the sentences 


14+i-+0, 
Jxxex=1 +1 
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are true in some fields but false in others. The theory of algebraically | 
closed fields of characteristic 0 is complete, but this is by no means obvious, 
(See Theorem 26G.) 


(b) A complete axiomatizable theory (in a reasonable language) is de- 
cidable. 


of sentences such that T = Cn 2. of a diagram (in which we have included the results of Exercise 5): 


Decidable ; Finitely axiomatizable 


£ % 


Definition A theory T is finitely axiomatizable iff T = Cn for some 
finite set X of sentences. 


In the latter case we have T = Cn {o} (written “T = Cn o”), where o 
is the conjunction of the finitely many members of 2. For example, the 
theory of fields is finitely axiomatizable. For the class 7 of fields is Mod Ø, 
where Ø is the finite set of field axioms. And the theory of fields is Th 
Mod © = Cn È. 

The theory of fields of characteristic 0 is axiomatizable, being Cn ®,, 
where ®, consists of the (finitely many) field axioms together with the in- 
finitely many sentences: 


Effectively enumerable <=> Axiomatizable 


For example, a theory which is given in axiomatic form (such as Zermelo- « 
Fraenkel set theory, which is Cn Azp for a certain set Azp) is effectively 
enumerable. We will argue in Section 3.6 that set theory is not decidable 
and not complete, Number theory, the theory of the structure (N, 0, S, 
<, +,-, E), is complete but is not effectively enumerable and hence not 
axiomatizable (Section 3.5). 

We can use part (b) of the preceding corollary to establish the decid- 
ability of an axiomatizable theory, provided we can show that the theory 
in question is complete. This can sometimes be done by means of the 
Łoś-Vaught test for completeness. 

For a theory T and a cardinal A, say that T is A-categorical iff all models 
of T having cardinality A are isomorphic. 


1+19¢0, 
1+1+ 15606, 


This theory is not finitely axiomatizable. To prove this, first note that no 
finite subset of ®, has the entire theory as its set of consequences. (For that 
finite subset would be true in some field of very large characteristic.) Then 
avoly the following: 


| 
*Definition A theory T is axiomatizable iff there is a decidable set 2 We can represent the relationships among these concepts by means 


| Eoś-Vaught Test (1954) Let T be a theory ina countable language such 
| that 

Theorem 266 If Cn is finitely axiomatizable, then there is a finite | 

| 


1. Tis A-categorical for some infinite cardinal A. 
Žo S Z such that Cn Xo = Cn 2. is A-categor 


2. All models of T are infinite. 


Proof Say that Cn X is finitely axiomatizable; then Cn 2’ = Cnv for Then T is complete. 


ce one sentence T. In general t ¢ ~, but at least 2 f r. (¢ € Cn v = Cn Z) Proof It suffices to show for any two models % and % of T that X = ®. 
By the compactness theorem there is a finite Zo & Z such that %F% | Since % and B are infinite, there exist (by the LST theorem) structures 
Then X =N and B' = B having cardinality A. W is isomorphic to B’, so we 


Cart S Cn, S Cn X, have 


whence equality holds. a t= Teas => 


i 


Thus Y= %8. 


i 
| 


We can now restate Corollaries 25F and 25G in the present terminology: 


(If T is a theory in a language of cardinality x, then we must demand 
that x < A.) 
The converse to the Los~Vaught test is false. That is, there are complete 
theories which are not A-categorical for any A. 


*Corollary 26F (a) An axiomatizable theory (in a reasonable language) 
is effectively enumerable. 
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In Section 3.1 we will apply the Eos-Vaught test to prove the decid. 
ability of the theory of the natural numbers with zero and successor, It 


can also be used to prove the decidability of the theory of the complex 
field. 


Theorem 26G (a) The theory of algebraically closed fields of charac- 
teristic 0 is complete. 


*(b) The theory of the complex field 


g¢ = (C, 0, 1, +, +) 
is decidable, 


Proof Let 2 be the class of algebraically closed fields of characteristic 
0. Then / = Mod(#, U I’), where ®, consists as before of the axioms for 
fields of characterisic 0, and J’ consists of the sentences 


Va Wb Wea WO Ixa: x.x +b. x+ e0), 
Va Yb Ve Yda 0> Ixa x- xextb.x.xte:xt d = 0), 


The set G U I is decidable and Th — Cn(®, U T), so this theory 
is axiomatizable. Part (a) of the theorem asserts that the theory is also 
complete, whence it is decidable. . 

Part (b) follows from part (a). For we have © € 2, whence Th.’ < Th 6, 
The completeness of Th implies that equality holds; see Exercise 1. 

To prove part (a), we apply the Łoś-Vaught test. The models of Tho” 
are exactly the members of .% These are all infinite. We further claim that 
Th.Y is categorical in any uncountable cardinality. This is equivalent to 
saying that any two algebrically closed fields of characteristic 0 having the 
same uncountable cardinality are isomorphic. 

This last assertion is a known result of algebra. We will sketch the proof, 
for the interest of those readers familiar with this topic. Any field & is 
obtainable in the following way: (1) One begins with the prime subfield, 
which is determined within isomorphism by the characteristic of &. (2) 
One takes a transcendental extension, determined within isomorphism by 
the cardinality of the transcendence basis, i.e., by the transcendence degree 
of Č (over its prime subfield), (3) One finally takes some algebraic extension. 
We thus have a theorem of Steinitz: Two algebraically closed fields are 


isomorphic iff they have the same characteristic and the same transcen- 
dence degree. 


Aaea 
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If the transcendence degree of an infinite field ¥ is A, then the cardinality 
f is the larger of A and Ny. Hence for an uncountable field, the ee 

o : + . . 9 m 

f Jude from Steinitz’s theore. 
als the transcendence degree. So we conc Ste 

a two algebraically closed fields having same characteristic and the same 
uncountable cardinality are isomorphic. a 


The theory of the real field 
(R, 0, 1, +, 2) 


is also decidable. But this result (which is due to Tarski) is much Sp 
than the above theorem. The theory of the real field is not meets in 
any infinite cardinality, so the Lo$-Vaught test cannot be applie l. T 
i As a final application, we can show that the ordering of the rationals 


elementarily equivalent to the ordering of the reals: 
(Q, <q) = (R, <n), 


i = 

where Q and R are the rationals and reals, respectively, and E and a 

aR the corresponding orderings. To show apne: sate pce 

i mplete theory (whic e 

show that both are models of some co / vane 
inci i f each structure). The key fact is provi y 
coincide with the theory of eac j T 

: countable dense linear orderings are isomorphic. 

theorem of Cantor: Any twocoun ee 

i i back up a little. The language 

To give the details, we must l 1 : 

Pers and the parameters V and <. Let ô be the conjunction of the fo 


lowing sentences: o 
1. Ordering axioms (trichotomy and transitivity): 


Wx Wy Wale <yvx~yvy <x), 
Vx Vy(x < y =y Ex) 
Wx Wy Vale <y => y <z> x z). 


2. Density: p 
Vx Wy < y= dae < z< y)). 


3. No endpoints: 
Wx 3y Iz < x <2). 


The dense linear orderings without endpoints are, = See a 
i ich ¿ dels of 6. It is clear tha 
structures for this language which ate mo foe A 
infini e im that the theory of these orderings, Cn 0, 
all infinite. Furthermore, we claim t t ; : 
is categorical in cardinality No. This is provided by the following fac 


15 - 
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Theorem 26H (Cantor) Any countable model of ô is isomorphic to 
(Q, <a): l — 


We leave the proof to Exercise 3. 
We can now apply the Łoś-Vaught test to conclude that Cn ô is complete, 
Hence any two models of 6 are elementarily equivalent; in particular, 


(Q, <o) = (R, <p). 


_ We now show by induction that every formula has an equivalent prenex 
“formula. 


1. For atomic formulas this is vacuous, as any quantifier-free formula is 
trivially a prenex formula. i ; 

2. Ife is equivalent to the prenex g’, then Vx a is equivalent to the prenex 
Vxo'. 

3, If œ is equivalent to the prenex a’, then =œ, is equivalent to sw’. 


We can also conclude that these structures have decidable theories. Apply Q1 to a’ to obtain a prenex formula; for example, 


Prenex normal form ae a ee 


It will at times be convenient to move all the quantifier symbols to the 4. Finally we come to the case of œ >f. By inductive hypothesis we 
left of other symbols. For example, have prenex formulas œ’ and p’ equivalent to œ and £, respectively. By our 
theorems on alphabetic variants, we may further assume that any variable 
Vx(Ax — Vy Bxy) which occurs quantified in one of the formulas « and p’ does not occur at 
is equivalent to all in the other. We then use Q2 and Q3 to obtain a prenex formula equiv- 
Vx Wy(Ax —> Bxy). alent to a’ > f' (and hence to a :—> p). Observe that there is some latitude » 
And in the order in which the rules Q2 and Q3 are applied. For example, 
Wx(Ax —> dy Bxy) i Vx 3y p —> Juy 
is equivalent to 
: Wx Jy(Ax — Bxy). | (where x and y.do not occur in p, u does not occur in g) is equivalent to 
any of the following: 
Define a prenex formula to be one of the form (for some n > 0) Ix Vy du(p > y), 
i : 
O4%1 sete OnXn Q, | dx su Yyy = y), 
: Ju 3x Vyp => y). 
where Q; is V or A and œ is quantifier-free. | y=) 
' Retrospectus ' 
sie DE P eni For any formula we can find a logically | r ohak dai bolic logic i 
equivalent prenex formula. t the beginning of this book it was state that symbolic logic 1s a 
i ; , | : mathematical model of deductive thought. This is as good a time as any 
Proof We will make use of the following quantifier manipulation rules. | * to reflect on that statement, in the light of the material treated thus far. 
Qla. Yxa E 4 3x ma, As a first example, consider a mathematician working in set theory. He uses 
Qib. Axe H 4Yx ma. : a language with an equality symbol, a symbol € for membership, and nu- 
Q2a. (a->Vx p) 4 Vx(« > p) for x not free in æ. merous: defined symbols (Ø, U, ete.). In principle the defined symbols could 


be eliminated and any sentence replaced by an equivalent sentence in which 
the defined symbols, did not appear. (In this connection see Section 2.7, - 
where this topic is treated systematically.) He takes as primitive (or un- 
defined) notions the notions of set and membership. He adopts some set 
Agp of axioms involving these notions. He asserts that for certain sentences 


Q2b. (œ —> 3x $) E 4 dx(a—> f) for x not free in æ. 
Q3a. (Vxa—- 8) ES īx(æ =f) for x not free in £. 
Q3b. (xa -> p) H 4 Vxl — f) for x not free in $. 


Q1 is clear; for the others see pages 113f. and Exercise 8 of Section 2.4. 
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(his theorems), these sentences are true provided the axioms are true, re- 
gardless of what the undefined notions of set and membership actually 
mean. In support of these assertions he offers proofs, which are finitely 
long arguments intended to convince his colleagues of the correctness of 
the assertions. 

In terms of first-order logic we can describe all this as follows: The 
language here is a first-order language with equality and a two-place predicate 
symbol €. Thus Y and e are the only parameters open to interpretation, 
There is a certain set Ag of sentences in this language singled out as being 
the set of (nonlogical) axioms. Then certain other sentences are logical 
consequences of Ag, i.e., are true in any model of Asr. Ifv is a conse- 
quence of Agr (and only then), there is a deduction of t from Agr. 

Next consider a more typical case of the hypothetical working mathe- 
matician, that of the algebraist or analyst. The algebraist uses axioms for 
(say) group theory, but he also employs some amount of set theory, Similarly, 
the analyst deals with sentences that involve both numbers and sets of 
numbers. In both cases it is generally recognized that one could, in prin- 
ciple, convert the assertions of algebra and analysis to assertions of set 
theory. And then the remarks of the preceding paragraph again apply. 

The interest that symbolic logic holds for the mathematician is largely 
due to the accuracy with which it mirrors mathematical deductions. In 
the long run, it will surely be useful to understand the fundamental processes 
of doing mathematics. 


There remains the question of the accuracy with which first-order logic © 


mirrors nonmathematical deductive thought. Logic, symbolic and non- 


symbolic, has always formed a traditional part of the philosophical study ` 


of the process by which men come to hold certain ideas. Nonmathematical 
examples to which first-order logic applies are provided by a vast array 
of frivolous situations. Lewis Carroll gave such examples, one of which 
inferred that babies cannot manage crocodiles from the three hypotheses: 
(1) Babies are illogical. (2) Nobody is despised who can manage a crocodile. 
(3) Hlogical persons are despised. 

But what of nonfrivolous situations? Here the applicability is obscured 
by the fact that. we usually do not make explicit the assumptions we use in 
drawing conclusions. There are specific areas (in diverse fields such as 
physics, medicine, and law) where assumptions not only can be made ex- 
plicit but are being made explicit. In some cases it appears that less than 
the full versatility of first-order logic is required to formalize the real-life 
deductions. In the case of quantum mechanics, more features may be 
necessary. 
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EXERCISES 


1. Let T, and T, be theories such that Gi) T, S T,, (ii) T, is complete, 
and (iii) T, is satisfiable. Show that T) = T). 


2, Establish the following facts: 
(a) 5, S X, => Mod X, © Mod ay 
H EA, [=> ThA, S Th. 

(b) 2 = Th Mod Z and % < Mod Th”. 

(c) Mod Z = Mod Th Mod X and Th 4 = Th Mod Th %. (Part (c) 
follows from (a) and (b).) 

3, Prove that any two countable dense linear orderings without endpoints 
are isomorphic (Theorem 26H). Suggestions: Let Mf and B be such structures 
with |N | = {a,a,...} and | B| = {bo, by, --.}- Construct an isomor- 
phism in stages; at stage 2n be sure a, is paired with some suitable b;, 
and at stage 2n + 1 be sure bẹ is paired with some suitable a;. 


4, Find prenex formulas equivalent to _ 


(a) (Ax Ax a Sx Bx) - Cx. 
(b) Wx Ax <> dx Bx. 


*5. Prove the converse to part (a) of Corollary 26F: An effectively enu- 
merable theory (in a reasonable language) is axiomatizable. 


6. Consider a language with a two-place predicate symbol <, and let 


= (N, <) be the structure consisting of the natural numbers with their 


usual ordering. Show that there is some Y elementarily equivalent to N 
such that <% has a descending chain. (That is, there must be do, ais 
in || such that aj, a € <% for all i.) 


7. Show that an infinite map can be colored with four colors iff every 
finite submap of it can be. Suggestion: Take a language having a constant 
symbol for each country and having four one-place predicate symbols 
for the colors. Use the compactness theorem. 


8. Let 2, and Z, be sets of sentences such that nothing is a model of 
both X; and X,. Show that there is a sentence 7 such that Mod 2, © Mod t 
and Mod X, & Mod ~r. (This can be stated: Disjoint EC, classes can 


be separated by an EC class.) l 

9, Assume that o is true in all infinite models of a theory T. Show that 
there is a finite number k such that ø is true in all models W of T for which 
|A| has k or more elements. 
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§2.7 INTERPRETATIONS BETWEEN THEORIES! 


In some cases a theory 7, can be shown to be every bit as powerful 
as another theory Tọ. This is certainly the case if the theories are in the 
same language and To S 7,. But even if the theories are in different lan- 
guages, there may exist a way of translating from one language to the 
other in such a way that members of 7) are translated as members of T,. 
This sort of situation will be examined in this section. 

We will begin by discussing the topic of defined symbols. This topic, 
as well as having significant interest of its own, will serve as an example 
for the situation of the preceding paragraph, wherein 7, is constructed 
from T, by adding a new defined symbol. If the definition is done properly, 
the original theory 7; should in principle be just as strong as the new To. 
We will consider only the case of defined function symbols, since the case - 
of defined predicate symbols presents, in comparison, no real difficulties. 


- Defining functions - 


Frequently in mathematics it is useful to introduce defnivions of new 
functions. For example, in set theory one defines the power-set operation 
f by a sentence like, “Let Px be the set whose members are the subsets 
of x.” Or by a sentence in a formal language (here containing €, S, and P’), 


Vo VF, = v, <> Vulfu E v > u S n). 


Now definitions are unlike theorems and unlike axioms. Unlike theo- 
rems, definitions are not things we prove. We just declare them by fiat. But 
unlike axioms, we do not expect definitions to add substantive information. 
A definition is expected to add to our convenience, not to our knowledge. 

If this expectation is to be realized, the definition must be made in a , 
reasonable way. As an example of a most unreasonable definition in number, 
theory, suppose that we introduce a new function symbol f by the “defini- 
tion” 

S@M=y iff x <y. 
(Or by the sentence in a formal language: V0, Vol fry = Vy +> V < 09).) 
Since we know that 1 < 2, we see that /(1) = 2. But also 1 <3, so we 
obtain f(1) = 3. And so we come to the conclusion (ynich does not it- 
self involve f) that 2 = 3. 


! The results of this section will be used only in Section 3.6. 


language, if 


| 
| 
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Obviously this definition of f was in some way very bad. It did not just 
make matters convenient for us; it enabled us to conclude that 2 = 3, 
which we could not have done without the definition. The trouble was that 


‘the definition bestowed the name “f(1)’ ambiguously upon many things 


(2 and 3 among them), Thus /(1) was not “well defined.” Names ought to 
designate unique objects. 

In this subsection we want to consider conditions under which we can. 
be assured that a definition will be satisfactory. To simplify the notation, 
we will consider only the definition of a one-place function symbol f, but 
the remarks will apply to n-place function symbols as well. 

Consider then a theory T in a language not yet containing the one-place 
function symbol f. (For example, T might be the set of consequences of 
your favorite axioms for set theory.) We want to add f to the language, 


‘introducing it by the definition 


(6) Vo, Vol fo, ~ 


Vo <> pl, 


where o is a formula in the original language (i.e., a formula not con- 
taining f) in which only v», and v, may occur free. 


Theorem 27A In the above situation, the following are equivalent: 


(a) (The definition is noncreative.) For any sentence o in the smaller 
T;dEo 


(in the augmented language), then already TF o. 
(b) (f is well defined.) The sentence 


(e) Vv, Aln p 


is in the theory T. (Here “Alv, y” is an abbreviation for a longer formula; 
see Exercise 21 of Section 2.2.) 


Proof To obtain’(a) => (b), simply note that ô } e. So by taking o = € 
in part (a), we obtain T Ẹ e. 

Conversely, assume that T | £. Let be a model of T. (Ñ is a structure 
for the original language.) For de | %|, let F(a) be the unique e e |% | 
Such that Hy o [d, e]. (There is a unique. such e because Fy e.) Let X, F) 
be the structure for the augmented language which agrees with Ù on the 


Original parameters and which assigns F to the symbol f. Then it is easy 


to see that (X, F) is a model of ô. Furthermore, Ù and (Ul, F) satisfy the same 
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sentences of the original language. In particular (UW, F) is a model of T, 
Hence 


T; ò F o> Fa, mo 


= Fao. E 


(This argument can be stated more briefly by using second-order logic, 
e is logically equivalent to the sentence Jf ô.) 


Interpretations 


The basic idea is that it is possible for one theory to be just as strong 
(in a sense to be made precise) as another theory in another language. In 
considering two languages simultaneously, there is no point in allowing 
themi to conflict; e.g., the negation symbol of one language should not bea} 
predicate symbol in the other. We can eliminate such conflicts by assuming 
that each of the languages is obtained from a third parent Janguage by 
deleting some parameters (and perhaps equality). 

For example, axiomatic set theory is at least as strong as the theory 
of the natural numbers with zero and successor, i.e., the theory of (N, 
0, S). Any sentence in the language of (N, 0, S) can be translated in á 
natural way into a sentence of set theory. (This translation is sketched 
in Section 3.6.) If the original sentence was true in (N, 0, S), then the transla- 
tion will be a consequence of the axioms of set theory. (This is not obvious; 
the proof uses facts to be developed in Section 3.1.) 

Let us look more. carefully at a second example. Consider on the. one 
hand the theory of | 


(N, 0, 5S) 
in its language, and on the other hand the theory of 
(Z, +, 7 


in its language. (Here Z is the set of all integers, positive, negative, and 
zero.) We will shortly be in a position to claim’ that the second theory } 
as strong as the first. How might a sentence about the natural numbers 
with 0 and S be translated into a sentence about the integers Z with ad- 
dition and multiplication? 
The first clue is that an integer is nonnegative iff it is the sum of fou | 
squares, so a quantifier Vx in the first language (where x is intended to range | 


| 
| 
| 
| 
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over N) can be replaced by 


Wx(4y, Iy Iys Iya X = Va Yı F Ye Ya F Ys ° Ya F Yat Ya 


in the second language. 
The second clue is that {0} and the successor function (viewed as a 
relation) are definable in (Z, +, +). The set {0} is defined by 


v + A == Y.. 
The successor relation (extended to Z) is defined by 
Walz ez ZAZ4+ 2A Z= v + Z= v). 
Thus the sentence about (N, 0, 5) 


Vx Sx a 0 


can be translated into 


Wxldys Iya Iya Iya x = Ya Yı F Ye Ya F Ya * Ys F Vat Ya 
aVulu -+ u = u > Yo(Wz(z z= ZAZ + ZH ZX A z a v) 


v = u))]. 


So much for examples. For our general discussion it will -be helpful to 
introduce the notation 
plt) = Pi; 
Ptr» to) = (i ta 


and so forth. Thus p = plv) = pi, v2). If we use “p(x)” we will not 
worry too much about whether or not x is substitutable for v in g. If it 
is not, then we actually want p(x) to be py, where y is a suitable eens 
variant of g. 

Assume now that we have the following general situation: 

L is a language. (A language can for all practical purposes be a set of 
parameters, possibly augmented by the equality symbol.) 

T, is a theory in a (possibly different) TONERE L,, which includes 
equality. ; 


Definition An interpretation x of Ly into T, is a function on the set of 
parameters of Lo such that 
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l. x assigns to V a formula sty of L, in which at most vı occurs free, such 


that 


(i) (ae ae 


(The idea is that in any model of 7,, the formula zy Should define a 


nonempty set to be used as the universe of an L-structure.) 


2, % assigns to each n-place predicate symbol P a formula tp of L 


in which at most the variables v,,..., Va occur free. 


3. æ assigns to each n-place function symbol fa formula a, of L, in 


which at most 1, ...,0,, %,4, occur free, such that 


(ii) M E Woy ++ Wan (ay(v1) > ++ => tylo) 


=> Ax(sty(x) A VOGO, -o y Papa) 4 Yaa. = X))). 


(In English, this formula becomes, “For all V in the set defined by ay, 


there is a unique x such that nV, x), and furthermore x is in the set defined : 


by sy.” The idea is to ensure that in any model of T,, x, defines a function 
on the universe defined by my. In the case of a constant symbol c, we 
have n = 0 and (ii) becomes 


T, F Ax(sty(x) a Voil) +> vy = x)). 


In other words, x, defines a singleton whose one member is also in the set 
defined by xy.) 


For example, if Ly is the language of (N, 0, S) and 7; is the theory of 
(Z, +, +), then we have 


myl) = Ayy Aye Iya Iya x = yy e Pa F Va Pa F Va e Yen Vay 
Teg(X) = x + x = x, À ” 


as(x, y) = Vy(z °ZeZAZ+ zz >x+72~y). 
(We are here exploiting the fact that in (Z, +, +) we can 
the structure (N, 0, 5).) 


If Lo coincides with L,, there is trivially the identity interpretation 2, 
for which 


, in effect, define 


Ty = N = Vi, 
np = Pri tt dp, 


ny = fv, ee Va © Vapi: 


The conditions (i) and (ii) are then met no matter what T, is. 
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Now assume that z is an interpretation and let 8 be a model of 7;. 


There is a natural way to extract from % a structure 7% for Ly. Namely, 
let 


| 7B | = the set defined in B by zy, 
p*® = the relation defined in 8 by xp, restricted to | “8 |, 
fa, ..+ 4) = the unique b such that Fig (a1, -> an b], 
where @,,...,@, are in |78]. 


By condition (i) in the definition of interpretations, |78 | + ©. And, 7 
condition Gi), the definition of f “8 makes sense; i.e., there is a unique 


meeting the above condition. Hence "%9 is indeed a structure for the lan- 


uage Lo. d . , 
. Define the set 2 1[7,] of Lo-sentences by the equation 


aiT] = Th{*B : 8 e Mod Ty} 


. n rE 
= {ø :ø is an Lo-sentence true in every structure B obtainable 


from a model B of T4}. 


This is a theory, as is Th .% for any class 2. It is a satisfiable theory iff 
T, is satisfiable. 


Exampte Earlier in this section we had a theory T containing the 
sentence ; 


(e) Wo, J lva p. 


We augmented the language to a larger language L+ which -n a 
function symbol f. The “definition” of f was provided by the Lt-sentence 


(ô) Vo Vnt = 2 +> p). 


We showed that for a sentence o in the original language of T, if T; ô Eo, 
then T E ø. . ' l l re 
We a an interpretation æ from L* into T. x 1s the identity interpreta 
tion on all parameters except f. The formula 7p is p. The fact that THe 
is just what we need to verify that s is indeed an interpretation. For any 
model X of T, “Ù is a structure previously called (X, F); it is a model of 


26Ts8. 


We claim that 
wT] = Cn(F ; ô). 


hi 
i 
i 
it 
| 
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First observe that any model % of T ; 6 equals "Y, where % is the restriction 
of B to the language of T. Hence for an L*-sentence o, 


cen YT] Fayo for every model Ñ. of T 
<>Fgo for every model % of T; 6 
>T; ô Ho. 


Syntactical translation 


In the preceding subsection on interpretations we talked about arbitrary 
models and such. But the reader may already have noticed that there is a 
much more down-to-earth thing to be said about an interpretation x of Ly 
into T,. Briefly: We can, given a formula pof Lo, find a formula g” in L 
which in some sense corresponds exactly to y. We define g” by recursion 
on 9. 

First, consider an atomic formula œ of Ly. For example, if a is 


Pex, 
then æ is logically equivalent to 
Vy(gx = y => W2( fy = z => Pz)). 
And we can take for o” the L -formula 
Vy (g(x, y) > Vzor, 2) > ep(z))). 


In general, scan an atomic formula « from right to left. The rightmost 
place at which a function symbol occurs will initiate a segment of the form 
8X, +++ Xp for an n-place g. (In the example n = 1.) Replace this by some 
new variable y, and prefix VYA ..., Xas y)—>. Continue to the next 
place at which a function symbol occurs. Finally, replace the predicate sym- 
bol P (if a parameter) by xp (with the correct variables). 

The definition of œ” can be stated more carefully by using recursion on 
the number of places at which function symbols occur in a. If that number 
is zero, then œ is Px, +++ x, and o” is Tpx, -.., Xa) Otherwise, take the 
rightmost place at which a function symbol g occurs. If g is an n-place 


symbol, then that place initiates a segment 8X1 tt Xp. Replace this seg 


ment by some new variable y, obtaining a formula we can call agen, Then 
a is 


Vy (tg, -es Xps Y) —> (agti tny), 
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_ For example, 


(Pfa) = Y yll, y) > (PHY) 
= Vye y) > W2(ai,(y, 2) > (P29) 
= Vy(oufx, y) > W2leuj(y, 2) > mp2). 


The interpretation of a nonatomic formula is defined in the obvious 


way. (~p)" is (mg”), (p — y)" is (p —> y”), and (Y xoy is Yx(ry(x) = g”). 


(Thus the quantifiers are “relativized” to ay.) i 
The sense in which g” “says the same thing” as ọ is made precise in the 


following basic lemma. 


Lemma 27B Let x be an interpretation of & into T}, let B be a ione 
of T,. For any formula of Lọ and any map s of the variables into | "8 |, 


key P is] if Fag” [s]. 
This is not a deep fact. It just says that y* was defined correctly. 


Proof We use-induction on g, but only the case of an atomic formula 
æ is nontrivial. For a, we use induction on the number of places at which 
function symbols occur. It is easy if that number is zero. Otherwise, 


o” =V yla (x, y) > p”), 


where BY, = œ. (We have quietly assumed that g is a one-place symbol; 
the notation is bad enough already.) Let 


b = the unique b such that Ey atg Is), b] 


= g"¥(s(x)). 
Then 
aor [s] <> Ea 8" (910) i 
<> Fag B solb) by the inductive hypothesis 
<> Ex e Bi, Is] by the substitution lemma 


<> Ery g [s]. E 
The following corollary justifies, our choice of notation for #7 "[T;]. 


Corollary 27C For a sentence o of Lo, 


cen NT] iff o ET, 
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Proof Recall that by definition 


o € nT] < for every model B of -T,, Fags 
<> for every model B of T}, Hyo” by Lemma 27B 
<> T, F o. E 
Definition An interpretation x of a theory T, into a theory T; is an in- | 
terpretation æ of the language of Ty into T, such that 


To € wT]. 
In other words, it is necessary that for an Ly-sentence o, 
o E To => 0E T. 


x—'[T,] is the largest theory which x interprets into T,. If T 9 = a h 
then we have 


o € To 0" E€ T). 


To return to an earlier example, consider the structures (N, 0, S) and 
(Z, +, +). We had an interpretation x into Th (Z, +, -), where 


nal) = Ay, Ay, Iys Iya x Vie Ya F Yee Va F Ye Ya F Va Vas 
mo) = x 4- x= x, 
mg, y) = Vz(z ez =z Az + ZA z= x 4 zæ y), 


We now claim that x is a faithful interpretation of Th (N, 0, S) into Th (Z, 
+, +). For in this case, (Z, +, -) is the structure (N, 0, S). Hence 


Fw,0,s)9 > EG <> Fz, +, 07, 


In Chapter 3 we will be able to show that there is no interpretation of Th(Z, 
+, +) into Th(N, 0, S). Thus the former theory is strictly stronger than’ the 


In this case æ is said to be a faithful interpretation of T, into T}. 
| 
| 
| 

latter. | 


Finally, let us return to the situation with which we started this section. 
Assume that T is a theory containing the sentence e, where 


e = Vv, dle, o; 
6 = Yn Yoli ~ n + 9); 
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L* = the language obtained by adding the new function symbol f to 
the language of 7; 


z == the interpretation of L+ into T which is the identity interpreta- 
tion on all parameters except f, and Tig = p.. 


In fact, a is a faithful interpretation of Ca(T ; 6) into T, since, as noted 


© previously, 


a [T] = Cul ; ô). 
We can now draw an additional conclusion; the definition is eliminable. 


Theorem 27D Assume that we have the situation described above. 
Then for any L*-sentence ø we can find the sentence o” in the original 
language such that 

(a) T; ô E (0 <= 0%). 

(b) T; ô ko <T Eo. 

(c) If f does not occur in e, then E (e <» 0), 


Proof Part (c) follows from the fact that x is the identity interpretation 
on all parameters except f. Part (b) restates that x is a faithful interpretation 
of Cn(T ; ô) into T. Since x is faithful, for (a) it suffices to show that 


T F (0 +> o7)*. 


This follows from (c), since (ø <> 0”) is (o” <> o7”), which is valid. @ 


EXERCISES 


1, Assume that L, and L, are languages with the same parameters except 
that Ly has an n-place function symbol f not in Z, and ZL, has an (n + 1)- 
Place predicate symbol P not in Ly. Show that for any Lo-theory T there is 
a faithful interpretation of T into some Ly-theory. 


2. Let Lo be the language with equality and the two-place function sym- 
bols + and -. Let L, be the same, but with three-place predicate symbols 
for addition and multiplication. Let 9; == (N +, +) be the structure for 
L, consisting of the natural numbers with addition and multiplication 


| ds 0, 1). Show that any relation definable by an Lo-formula in Ng is also 


definable by an L,-formula in 9. 


» Show that an interpretation of a complete theory into a satisfiable 
cory is faithful. 
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§2.8 NONSTANDARD ANALYSIS‘ 


The differential and integral calculus was originally described by Leibniz 
and Newton in the seventeenth century in terms of quantities which were 
infinitely small yet nonzero. Newton used in his-calculations a number o 
which, being infinitely small, could be multiplied by any finite number and 
still be negligible. But it was necessary to divide by o, so it had to be nonzero. 
Leibniz’s dx was less than any assignable quantity, yet was nonzero, 

These ideas were not easy to comprehend or to accept. Throughout 
the eighteenth century this business of working with infinitesimals was 
attacked (e.g., by Bishop Berkeley), distrusted (e.g.,' by D’Alembert), and 
used in enthusiastic experimentation (e.g., by Euler). While Euler was creat- 
ing the mathematics which students now study in advanced calculus, he 
used infinitesimals in a loose, free-swinging manner that would not be 
tolerated in today’s freshmen. Only in the nineteenth century were the foun- 
dations of calculus presented in the form now found in textbooks. The 
treatment of limits was then rigorous, and debate subsided. 

In 1961 Abraham Robinson introduced a new method for treating limits, 
rescuing infinitesimals from, their intellectual disrepute. This method com- 
bines the intuitive advantages of working with infinitely small quantities 
with modern standards of rigor. The basic idea is to utilize a nonstandard 
model of the theory of the real numbers. 


Construction of *3t 


We will use a very large first-order language. In addition to ‘symbols 
for +, -, and < we might as well add symbols for the exponentiation and 
absolute-value functions. And since there is no good reason to stop there, 
we go all the way and include a symbol for every operation on the set R 
of reals: We do the same for every relation on R. Thus we have the language 
with equality and the following parameters: 


0. Y, intended to mean “for all real numbers.” 

1. An n-place predicate symbol Pz for each n-ary relation R on R. 
2. A constant symbol e, for each re R. 

3. An n-place function symbol fp for each n-ary operation F on R: 


For this language there is the standard structure R, with | f | = A 
P3 = R, c* = r, and ff = F. But now let us form a nonstandard structure, 


1 This section may be omitted without loss of continuity. 


| 
| 
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by using the compactness theorem. Let I’ be the set 
Th*t U {e,P.v,: 7 © R}. 


(Here c, P< v, formalizes “r is less than v,.”) Any finite subset of I” can be 
satisfied in R by assigning to v, some large real number. Hence by the 


' compactness theorem there is a structure Ñ and’an element a € |X | such 


that I is satisfied in X when », is assigned a. Since Ù is a model of Th 8, 
we have { =R. There is also an isomorphism h of St into (but not onto) 
A defined by 

hr) = ec. 


To check that this is indeed an isomorphism, we use the fact that WHR 
h is one-to-one, since for r, Æ Fg, the sentence ¢,, AY Cr, holds in R and 
hence in X. A preserves a binary relation R(= P®) since for any r and s 
in R, 
<r, 8) € Ph on FaPrests 
<> Fu RE, 
<> Ce, c% e PY 


<> Ch(r), h(s)> € PX. 
A similar calculation applies to any n-ary relation. Next we must show that 
h preserves any function F(= f R), Again for notational ease, suppose that 
F is a binary operation. Consider any r and s in R, and -let ¢ = F(r, s). 
Then 
ACFE, sS) = AGW, s) 
= h(t) 


=F; 
Now the sentence c, ~ Jfre,e, holds in R and hence in Y. Hence 
oft = fC D 
= FRAC), As). 
So A preserves fy. For the constant symbols we have by the definition of h, 


Ate?) = hlr) 


= c, 
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Properties of R which cannot be expressed in the language are likely 
| to fail in *R. The least-upper-bound property is one such. There are non- 
empty bounded subsets S of *R which have no least upper bound (with 
respect to the ordering *<). For example, R is such a subset of *R. It 
is bounded by the infinite b of the preceding paragraph. But it has no least 
upper bound; see Exercise 7. 

Define the set # of finite elements by the equation 


Since we have an isomorphic copy of § inside MW, we can find another _ 
structure *3t isomorphic to UW such that K is a substructure of *St. The idea 
is simply to replace in Ñ the point c% by the point r (provided that [Wl A 
R = ©, as can always be arranged). For details, see Exercise 23 of Section 
2.2. Since * is isomorphic to Y, there is a point b e | *St| such that *9 
satisfies I’ when », is assigned b. In particular, *R = MR. 

To go much further, we need a less cumbersome notation. We will use 


an asterisk to indicate passage from # to “3. 
; i aai = {xe *R:*| x| *< y for some ye R}. 
1. For each n-ary relation R on R, let *R be the relation Pt assigned ' 


to the symbol Pr by *. In particular, R is a unary relation on R. Its 
image *R equals the universe of *§t, since the sentence Vx Prx is true in R 
and hence in *St. Since R is a substructure of “St, we have that each relation 
R equals the restriction of *R to R 

2. For each n-ary operation F on R, let *F be the operation f°}? assigned 
to the symbol fp by *R. F is then the restriction to R of *F, 


Similarly, define the set Z of infinitesimals by the equation 
J = {xe *R:*|x|*<y for all positive y € R}. 


If A € R is unbounded, then *A contains infinite points. For the sen- 
tence “for any real r there is an element a € A larger than r” is true and 
formalizable. Take some infinite positive b; there must be a larger (and 
hence infinite) member of “A. For example, *N contains infinite numbers. 
‘The only standard infinitesimal, i.e., the only member of R MY, 
is 0. But there are other infinitesimals. For by the usual (formalizable) rules 
for inequalities, the reciprocal of any infinite number is an infinitesimal. 


Observe that ¢; = r, so we need no special notation for this. 

There is a general method (to be used heavily in the remainder of this 
section) for demonstrating. properties of a relation *R or an operation *F. 
One simply observes (1) that R or F has the property, (2) that the property 
can be expressed by a sentence of the language, and (3) that N = 

For example, the binary relation *< on *R is transitive. This is because s 3 
< is transitive, and this property can be expressed by the sentence l Algebraic properties 


Vx Vy W2(xP<y => yP< z -> xP. 2). which will be useful later. 
By similar reasoning, *< satisfies trichotomy on *R and thus is an order- 
ing relation on *<, 

For another example, we can prove that the binary operation *4- on 
*R is commutative, since -+ is commutative and the commutative law 
can be expressed by a sentence. By applying this reasoning to each of the 
field axioms, we see that (*R, 0, 1, “+, *-) is a field. 

This general method is used so much, we will shortly begin to take it 
for granted. If, for example, we assert that *| a*+ b| *<*|a| *+ *|b| 
for a and b in *R, we will take for granted that the reader perceives that 
the general method yields this fact. : 

We have R S *R, but R>4*R. For we have a point b such that — 
Fon ¢,P <0, [b]; ie, r*<b. Thus b is infinitely large, being larger (in 
the ordering *<) than any standard r, i.e., any r € R. Its reciprocal 1 */b 
will be a sample infinitesimal. 


Theorem 28A (a) F is closed under addition me subtraction *— 
and multiplication *-. 

(b) Z is closed under addition *+, subtraction *—, and multiplication 
from F: 

xe? and yePWo>x* yeg. 


In algebraic terminology, part (a) says that 7 is a subring of the field 
*R, and part (b) says that 7 is an ideal in the ring F. We will see a little 
later what the quotient ring F LF is. 


Proof (a) Let x and y be finite, so, that * rx aa, *|y|*<b for 
Standard a and b in R. Then 


*|x*ty|*s*|x|*+ *ly|*<a+beR, 


In the next theorem we collect some algebraic facts about @ and Z - 
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whence x*+ y, x *— y are finite. Also For standard r and s, we have r œ s iff r = s, as 0 is the only standard 
TE E E k infinitesimal. 
. Lemma 28C If xÆ y and at least one is finite, then there is a standard 


k, i i i 
whence x*- y is finite, too. q strictly between x and y. 


(b) Let x and y be infinitesimals. Then for any positive standard a, 
*| x | *<a/2 and *| y| *< a/2. Hence Proof We may suppose that x *< y. In fact, we may further suppose 
thatO *< x *< y; the case x *< y *< 0 is similar and the case x *< 0 * < y 
*l x *®+ y|*<a/2 + a/2 =a, 
Since x is finite we have x *< mb for some positive integer m; take the least 
such m. Then x *< mb*< y. (By the leastness of m, (m —. 1)b *< x. 


So mb *<x*¥+ b*< y.) a 


so that x *-+ y and x *— y are infinitesimal. If z is finite, then *| z | *< b 
for some standard b. Since x is infinitesimal, we have *| x | *< a/b, whence 


kfk, g| = PETS 
[x* z| *< Gf b)b a Theorem 28D Every xe @ is infinitely close to a unique r € R. 


Thus x *- z is also infinitesimal. C] 


Proof For'x 6 @, the set 
Definition x is infinitely close to y(x œ y) iff x *— y is infinitesimal. | S={yeR:y*<x} 


Theorem 288 (a) œ is an equivalence relation on *R. 
(b) If uœ v and x œ y, then u*+ x œv *+4 y and *— u œ *— y, 
(c) If uœ v and x œ y and x, y, u, v are finite, then u *: x œp *. y. 


of standard points below x has an upper bound in R. Let r be its least 
upper bound; we claim that x œr. 

If x 9¢ r, then by the lemma there is a standard q between x and r. If 
r<q*< x, then r fails to be an upper bound for S. If x *< q <r, then 
q is also an upper bound for S, contradicting the leastness of r. Hence 
xr, 

. This establishes the existence of r. As for the uniqueness, note that if 
xcxr and x cs, then r œs. For standard r and s this implies that 
r= 3, 


Proof This is a consequence of part (b) of the preceding theorem (7 
is an ideal in F). 
(a) œ is reflexive since 0 is infinitesimal. œ is symmetric since the nega- 


tive (*—) of an infinitesimal is infinitesimal. Finally, suppose that x c= 
and y cv z. Then 


xt z=(x*-y)"+4-DEF 


Corollary 28E Each finite x has a unique decomposition x = 5*+ i, 


ae where s is standard and i is infinitesimal. 
since {7 is closed under addition. . 


(b) If u œv and x œ y, then 
Utt xt o*t = už o) NEF 


We call s the standard part of x, st(x). Of course for standard r, st(r) = r. 
In the next theorem we summarize some properties of the st function. 


. Theorem 28F (a) st maps F onto R.” 
since is closed under addition. Also *—u œ *—y since J is closed under (b) st(x) = 0 iff x is infinitesimal. 
negation. (c) st(x *-+ y) = st(x) + st(y). 
(0) (u a*t y) = (u DAU DEE GH y) oy) SOE ERT E | 
=u* (x*— y)*4 (u*— v)* yeZ | Proof (a) and (b) are clear. Since st(x) = x and st(y) = y, we have by 
part (b) of Theorem 28B st(x) + st(y) ~= x *4- y. Hence the left side equals 
st(x *+ y). Part (d) is similar and uses part (c) of Theorem 28B. a 


since Z is closed under multiplication from F. 


is trivial. Since x g y there is a standard b such that 0 < b *< y *— x. 


isa ô > 0 such that 
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< Second remark: It is not really necessary to have dom F = R. It is 
enough for a to be an accumulation point of dom F. (a is an accumulation 
point of S iff a is infinitely close to, but different from, some member -of 


*S.) 


(In algebraic terminology, this theorem asserts that st is a homomorphism 
of the ring F onto the field R, with kernel Z. Consequently, the quotient 
ring FJZ is isomorphic to the real field R.) 

Henceforth in this section we will streamline our notation by omitting 
the asterisks on the symbols for the arithmetic operations *+, t, ; ; 
and */ ý r z : Corollary 28G F is continuous at a iff whenever x œa, then *F(x) 


œ F(a). 


Convergence Now: consider a function F :R— R and a standard a € R. Then the 


i ; i derivative F”(a) is 
In calculus courses convergence is usually treated in terms of «’s and 6's @) 


and variables which come delicately close to certain values. We will give 
here the beginning of an alternative treatment of convergence, where vari- 
ables come infinitely close to the limiting values. 


lim 
ha 


F(a + h) — F(a) 
ay E 


By our definition of limit, this can also be stated: F'(a) = b iff for every 
nonzero infinitesimal dx we have dF/dx æ b, where dF = *F(a + dx) 


Definition Let F:R— R. Then F converges at a to b iff whenever — F(a). Thus if there is such a b (i.e., if F'(a) exists), then 


x is infinitely close to (but different from) a, then *#(x) is infinitely close 


to b. F'(a) = st(dF/dx) 


Proof of equivalence with the ordinary definition: First suppose that 
F converges at a to b in the ordinary sense. That is, for any e > 0 there 


for any nonzero infinitesimal dx. Here dF/dx is the result of dividing dF by 
dx. The fact that we just use division here greatly facilitates calculations. 


EXAMPLE Let F(x) = x?. Then F'(a) = 2a, since 


: 
| 
| 
| 
| 


Oxz\|x-a|<d=|b-— FX)|<e for any x. 
dF E Ane eae ey) Se = 2A U + Cone 2a + dx ~œ 2a. 


The displayed sentence (concerning the standard numbers s and ô) is Cage dx 

formalizable and thus holds in *t. Now if x in *R is infinitely close 

to (but different from) a, then certainly 0 *lx—al|*< ô. Hence Theorem 28H If F(a) exists, then F is continuous at a. 
*| b = *F(x) | "< e. Since e was arbitrary, b œ * F(x), 


Conversely, assume that the condition stated in the definition is met. Proof For any nonzero infinitesimal dx we have 


Then for any standard e > 0, the sentence “F(a + dx) — F(a) 
a  F'(a), 


There exists 6 > 0 such that for all x, 0 4|a—x| < ô = | b — F(x) | <e dx 


The right side is standard, so the left side is at least finite. Consequently, 
when we multiply the left side by the infinitesimal dx we are left with the 
fact that *F(a + dx) — Fla) € Z; ien *F@ + dx) = F(a). a 


sila aaa iaaa aa 


(when formalized) holds in *$t, since we can take ô to be infinitesimal. 
Hence the sentence holds in R also. E i 


First remark: It is entirely possible that F does not converge at a to any 
number. On the other hand, F converges at a to at most one b. For if i 
is a nonzero infinitesimal, then b = st(*F(e + i)). It is traditional to denote 
this b by “lim, .,,/(x).” Thus : 


The reader should note that this result is not some nonstandard analog 
of a classical theorem. Nor even a generalization of a classical theorem. 
It is a classical theorem. It is only the proof which is nonstandard, The 
ame remarks apply to the next theorem. Let Fo G be the function whose 


Jim F(x) = tF + i). alue at a is F(G(a)). 
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Chain Rule Assume that G'(a) and F'(G(a)) exist. Then (Fo G)'(g) 
exists and equals F’(G(a)) - G'(a). 


Proof First, notice that “(Fo G) = *Fo*G, since the sentence Yn, 
Jfroatı = fefavı holds in the structures. Now consider. any nonzero infini- 
tesimal dx. Let 


dG = *G(a + dx) — 
dF 


G@), 

*(F o G)(a + dx) — (F ° GXQ) 
= *F(*G(@ + dx)) — F(G@)) 
*F(G(a) + dG) — F(G(a)). 


Then dG œ 0 since G is continuous at a. If dG 0, then by the last of these 
equations dF/dG œ F'(G(a)), whence 


=e a ~ rew). G'(a). 


If dG = 0, then dF = 0 and G'(a) = dG/dx = 0, so we again have 


AE 2. FNA): O. 

These theorems are but samples of the treatment of convergence in terms 
of infinite proximity. The method is not at all limited to elementary topics. 
One can construct delta functions 6 with the property that f% 6 =1 
and yet d(x) ~ 0 for x Æ 0. Original results in analysis (e.g., in the theory 
of Hilbert spaces) have been obtained by the method of nonstandard analysis. 
Possibly the method will be more widely used in the future as more analysts 
become familiar with it. 


EXERCISES 


1. (Q is dense in R.) Let Q be the set of rational numbers. Show that 
every member of *R is infinitely close to some member of *Q. 


2. (a) Let A S R and F: A — R. Then Fis also a binary relation ot 
R; show that *F : *A — *R. i 

(b) Let S: N — R. Recall that S is said to converge to b iff for every 
e > 0 there is some k such that for all n > k, | Sa) — b | < e. Show that 
this is equivalent to: *S(x) œ b for every infinite x € *N. 


(c) Assume that S;: N—> R and S; converges to b; for i = 1, 2. Show 


that S, + S, converges to b, + b, and S, - S, converges to b, - ba. 


a 


OSOA 


Mines 
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3. Let F: A — R be one-to-one, where 4 ¢ R. Show that if x6 *4 
but x ¢ A, then *F(x) ¢ R. 


4. Let A S R. Show that A = *A iff A is finite. 


5, (Bolzano—Weierstrass theorem) Let A © R be bounded and infinite. 
Show that there is a point p e R which is infinitely close to, but different 
from, some member of “A, Suggestion: Let S : N— A with S one-to-one; 
look at *.S(x) for infinite x € *N. l 


6. (a) Show that *Q has cardinality at least 2*0, where Q is the set of 
rational numbers. Suggestion: Use Exercise 1. 
(b) Show that *N has cardinality at least 2%», 


7, Let A be a subset of R having no greatest member. Then as a subset 


of *R, A will have upper bounds (with respect to the ordering *<) in *R, 


But show that A does not have a least such bound. 


CHAPTER THREE 


Undecidability 


§3.0 NUMBER THEORY 


In this chapter we will focus our attention on a specific language, the 
language of number theory. This will be the first-order language with equality 
and with the following parameters: 


V, intended to mean “‘for all natural numbers.” (Recall that the set M 
of natural numbers is the set {0, 1, 2, ...}.) 

0, a constant symbol intended to denote the number 0. 

S, a one-place function symbol intended to denote the successor function 
S: N— N, i.e., the function for which S(”) = n + 1. 

<, a two-place predicate symbol intended to denote the usual (strict) 
ordering relation on N. 

+, °, E, two-place function symbols intended to denote the operations 
+, +, and E of addition, multiplication, and exponentiation, respectively: 


We will let Ñ be the intended structure for this language. Thus we may 


informally write 
R= (N, 0, S, <, +, +, E). 


(More precisely,’| N| = N, 0® = 0, etc.) 
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| By number theory we mean the theory of this structure, Th R. As warming- 


up exercises we will study (in Sections 3.1 and 3.2) certain reducts of ®, 
ie., restrictions of N to sublanguages: 


Ny = (N, 0, S), 
Ny, = (N, 0, S, <), 
Na = (N, 0, 5, <, +). 


Finally, in Section 3.7 we will consider 
Nar re (N, 0, S, <, +, ‘), 


A 
For`each of these structures we will raise the same questions: 


(A) Is the theory of the structure decidable? If so, what is a nice set of 
axioms for the theory? Is there a finite set of axioms? 

(B) What subsets of N are definable in the structure? 

(C) What do the nonstandard models of the theory of the structure look 
like? (By “nonstandard” we mean “not isomorphic to the intended struc- 
ture.’”) 


Our reason for choosing number theory (rather than, say, group theory) 
for special study is this: We can show that a certain subtheory of number 
theory is an undecidable set of sentences. We will also be able to infer that 
any satisfiable theory which is at least as strong as this fragment of number 
theory (e.g., set theory) must be undecidable. In particular, such a theory 
cannot be both complete and axiomatizable. 

In order to show that our subtheory of number theory is undecidable, 
we will show that it is strong enough to represent (in a sense to be made 
precise) facts about sequences of numbers, certain operations en numbers, 
and ultimately facts about decision procedures. This last feature then lets 
us perform a diagonal argument that demonstrates undecidability. 

We could alternatively use, in place of a subtheory of number theory, 
some other theory (such as a fragment of the theory of finite sets) in which 
we could conveniently represent facts about decision procedures. 

Before giving examples of the expressiveness of the language of number 
theory, it is convenient to introduce some notational conventions. As a 
Concession to everyday usage, we will write 


x<y, x+y, xey, and xEy 


in place of the official 


<xy, Hy, «xy, and Exy. 
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For each natural number k we have a term S*0 (the numeral for k) which 
denotes it 


S = 0, S10 = S0, S20 = SSO, etc. 


(The set of numerals is generated from {0} by the operation of prefixing S.) 


The fact that every natural number can be named in the language will be 
a useful feature. . 

Even though only countably many relations on N are definable in M, 
almost all the familiar relations are definable. For example, the set of primes 
is defined in N by 


V, AE S'O AV 9, VUCO, = Vy e Vg > Va ~ SIO v va = S10). 
Later we will find it important to show that many other specific relations 
are definable in J. 

One naturally expects the expressiveness of the language to be severely 
restricted ‘when some of the parameters are omitted. For example, the set 
of primes, as we shall see, is not definable in R4. On the other hand, in Section 


3.7 we will show that any relation definable in ® is also definable in Ny. 


Preview 


The main theorems of this chapter are proved in Section 3.5. But we 
can already sketch here some of the ideas involved. We can assign to each 
formula o of the language of number theory an integer fæ, called the Gödel 
number of œ. Any straightforward way of assigning distinct integers to 
formulas would suffice for our purposes; a particular assignment is described 
at the beginning of Section 3.4. Similarly, to each finite sequence D of for- 
mulas (such as a deduction) we assign an integer YD), We can now state 
the following result, which asserts that there is no definable (in N) complete 
axiomatization of Th %. 


Theorem 30A Let A S Th Ñ be a set of sentences true in N, and assume 
that the set {fæ :œ € A} of Gödel numbers of members of A is a set de- 
finable in N. Then we can find a sentence ø such that o is true in LM but. ¢ 
is not deducible from A. 


Proof We will construct o to express (in an indirect way) that ø itself 
is not a theorem of A. Then the argument will go roughly as follows: If 
A } g, then ø is false, contradicting the fact that A consists of true sentences: 
And so'A f o, whence ø is true. 


— rr 
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To construct o, we begin by considering the ternary relation R defined 
by 


<a, bh, c> € R iff a is the Gödel number of some formula œ and c is the 
value of Z at some deduction from A of a(S°0), 


Then because {ita :@ e A} is definable in M, it follows that R is definable 
also. (The details of this step must wait until later sections.) Let @ be a for- 
mula which defines R in R. Let r be the Gödel number of 


Yv 10(01, 01, Y3). 
(We use here the notation of page 157.) Then let o be 
Vv, —0(S’0, SO, v5). 


Thus o says that no number is the value of F at a deduction from A of 
formula number r in which the numeral for r has replaced v4; i.e., no number 
is the value of at a deduction of o. 

Suppose that, contrary to our expectations, there is a deduction of o ` 
from A. Let k be the value of Z at a deduction. Then <r, r, ky € R and 
hence 

Ey e(S'0, S70, S*0). 
It is clear that 
o | 9S, SO, S*0) 


and the two displayed lines tell us that o is false in N. But A | ø and the 
members of A are true in N, so we have a contradiction. 

Hence there is no deduction of o from A. And so for every k, we have 
<r, r, kò ¢ R. Thus for every k 


Ly ~eCS0, SO, S*0), 
from which it follows (with the help of the substitution lemma) that 


Ex Vv, “0(S’0, S0, va); 


Le, o is true in 3. 


` We will argue later that any decidable set of natural numbers must ‘be 
definable in N. The conclusion will then be that Th N is not axiomatizable. 


© Corollary 30B The set {rv : Hyr} of Gödel numbers of sentences 
true in N is a set which is not definable in N. 
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Proof If this were definable, we could take A = Th N in the preceding 


What can be said of an. arbitrary model 
theorem to obtain a contradiction. 


N = (|U |, 0%, S%) 


$3.1 NATURAL NUMBERS WITH SUCCESSOR of the axioms Ag? S% must be a one-to-one map of |N | onto |X| — {0%}, 


by S1, S2, and S3. And by $4.7, there can be no loops of size n. Thus |% | 


We begin with a situation which is simple enough to let us give reason- must contain the “standard” points: 


ably complete answers to our questions. We reduce the set of parameters 

to just Y, 0, and S, eliminating <, +, +, and E. The corresponding reduct l 0% — SA(0%) > SUSO) — +, 

of N is 
which are all distinct. The arrow here indicates the action of S*. There 
may or may not be other points. If there is another point a in |A |, then 
there will be the successor of a, its successor, etc. Not only that, but since 
(by S3) each nonzero element has a predecessor (something of which it is 
the successor) which is (by S2) unique, | % | must contain the predecessor of 
a, its predecessor, etc. These must all be distinct lest there be a finite loop. 

. Thus a belongs to a “Z-chain”: 


Ns = (N, 0, S). 


In this restricted language we still have the numerals, naming each point 
in N. But the sentences we can express in the language are, fro: om the view- 
point. of arithmetic, uninteresting. 

We want to ask about Rs the same questions that interes: us in the case 
of N. We want to know about the complexity of the set Th Ne; we want to 
study definability in Ng; and we want to survey the nonstandard models 
of Ny. 

To study the theory of the natural numbers ith successor (Th Ns), 
we begin by listing a few of its members, i.e., sentences true in Ng. (These 
sentences will ultimately provide an axiomatization for the theory.) 


koe a> Sa) — S*”S*@) > +: 


(We refer to these as Z-chains because they are arranged like the set Z of 
all integers {..., —1, 0, 1, 2,...}.) There can be any number of Z-chains. 
But any two Z-chains must be disjoint, as S2 prohibits merging. Similarly, 
any Z-chain must be disjoint from the standard part. 

This can be restated in another way. Say that two points a and b in |A | 
are equivalent if the function S* can be applied a finite number of times 
to one point to yield the other point. This is an equivalence relation. (It 
is clearly reflexive and symmetric; the transitivity follows from the fact 
that S% is one-to-one.) The standard part of |Ù | is the equivalence class 
containing 0%. For any other point (if any) a in | |, the equivalence class 
of a is the set generated from {a} by S% and its inverse. This equivalence 
class is the Z-chain described above. 

Conversely, any structure B (for this language) which has a standard part 


S1. Wx Sx 440, a sentence asserting that zero has no predecessor. 
l 52. Wx Wy(Sx ~ Sy ~> x = y). This asserts that the successor function 
is One-to-one. 

53. Vy(y a2 0 — dx y = Sx). This asserts that any nonzero number 
is the successor of something. l 

54.1 Wx Sx ay x. 

54.2 Wx SSx ay x. 


S4.n Wx Sx ay x, where the superscript n indicates that the symbol 5 
occurs at n consecutive places. 


Let Ag be the set consisting of the above sentences S1, S2, S3, S84.” 
(n = 1,2, ...). Clearly these sentences are true in Ng; i.e., Ny is a model 
or As. Hence ; : and a nonstandard part consisting. of any number of separate Z-chains, 

Cn Ag S Th Rg. is a model of Ag. (Check through the list of axioms in Ag, and note that 
each is true in 8.) We thus have a complete characterization of what the 
models of Ag must look like. l 

Ifa model% of Ag has only counthbly many Z-chains, then |% | is count- 

able. In general, if the set of Z-chains has cardinality A, then altogether 


08 Ley §%(0%) = S%S%(0%)) sede s oe 


Ainiai aa ssid aasa aiana iiaii hlada iaa aaancha anaiai iii aanrennen aii: 


(Anything true in every model of Ag is true in this model.) What is not so 
obvious is that equality holds. We will prove this by considering arbitrary _ 
models of: Ag. 


Na Uaa anaes 
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the number of points in |Y | is $o + $, < 4. By facts of cardinal arithmetic 
(cf. Chapter 0) this number is the larger of & and A. Hence 


E N, if ù has countably many Z-chains, 
i ~ YA if Ù has an uncountable number A of Z-chains, 


Lemma 31A If 2 and Ù are models of Aş having the same number 
of Z-chains, then they are isomorphic. 


Proof There is a unique isomorphism between the standard part of Ñ 
and the standard part of 2’. By hypothesis we are given a one-to-one 
correspondence between the set of Z-chains of Ñ and the set of Z-chains 
of Ñ’; thus each chain of Ù is paired with a chain of W. Clearly any two 
Z-chains are isomorphic. By combining all the pieces (which uses the axiom 
of choice) we have an isomorphism of Y onto w’. a 


Thus a model of Ws is determined to within isomorphism by its number 
of Z-chains. For Ng this number is zero, but any number is possible. 

The reader should note that there is no sentence. of the language which 
says, “There are no Z-chains.” In fact, there is no set X of sentences such 
that a model A of Ag satisfies X iff X has no Z-chains. For by the LST 
theorem there is an uncountable structure WX with Ñ = Ns. But W has 
uncountably many Z-chains and ts has none. 


Theorem 31B Let% and % be uncountable models of Ag of the same 
cardinality. Then Y is isomorphic to %. 


Proof By the above discussion, Ñ has card UW Z-chains, and %9 has 
card 8 Z-chains. Since card X = card Y, they have the game number of 
Z-chains and hence are isomorphic. = 


Theorem 31C Cn As is a complete theory. 


. Proof Apply the Łoś-Vaught theorem of Section 2.6. The preceding 
theorem asserts that the theory Cn Ag is categorical in any uncountable 
power. Furthermore, Ags has no finite models. Hence the Log-Vaught 
theorem applies. a 


Corollary 31D Cn As = Th Mg. 


Proof We have Cn As S Th Ms; the first theory is complete and the 
second is satisfiable. a 


| 
' 
i 
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*Corollary 31E Th Ng is decidable. 


Proof Any complete and axiematizable theory is decidable (by Corol- 
lary 25G). Ag is a decidable set of axioms for this theory. 


Elimination of quantifiers 


Once one knows a theory to be decidable, it is tempting to try to find a 
realistically practical decision procedure. We will give such a procedure 
for Th Ns, based on “elimination of quantifiers.” 


Definition A theory T admits elimination of quantifiers iff for every 
formula p there is a quantifier-free formula » such that 


T F =y). 
Actually it is enough to consider only formulas 9 of a rather special form: 
Theorem 31F Assume that for every formula » of the form 
Ax(a ‘ “++ AQa), 


where each œ; is an atomic formula or the negation of an atomic formula, 
there is a quantifier-free formula y such that T H (p+ y). Then T admits 
elimination of quantifiers. 


Proof First we claim that we can find a quantifier-free equivalent for 
any formula of the form Ix 6 for quantifier-free 0. We begin by putting 0 
into disjunctive normal form (Corollary 15C). The resulting formula, 


Ax[(o A +t A Am) V Bod +++ ABV ++ V Eg ee Aa ED, 


is logically equivalent to 


Fx (wo A+++ A Am) V AxX(By A+++ ABa Y «++ y Ax(Ey A o> 


A È). 


By assumption, each disjunct of this formula cdn be replaced by a quanti- 
_fler-free formula. l i 
We leave it to the reader to show (in Exercise 2) that by using the above 
Paragraph one can obtain a quantifier-free equivalent for an arbitrary for- 
mula, Ea a : 


In the special case where the theory in question is the theory Th of a 
Structure Y, the definition can be restated: ThY admits elimination of 
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2. Complete the proof of Theorem 31F. 


3. The proof of quantifier elimination for Th Ns showed how, given a 
formula g, to find a quantifier-free y. Show that 


As Ey) 


without using the completeness of Cn Ag. (This yields an alternative proof 
of the completeness of Cn Ag, not involving Z-chains or the Los-Vaught 
theorem.) 


4, Show that a subset of N is definable in Ng iff either it is finite or its 
complement (in N) is finite. 


5, Show that the ordering relation {<m, ny :m <n in N} is not defin- 
able in Ny. | 


6. Show that Th Ns is not finitely axiomatizable. Suggestion: Show that 
no finite subset of Ag suffices, and then apply Section 2.6. 


§3.2 OTHER REDUCTS OF NUMBER THEORY! 


First let us add the ordering symbol < to the language. The intended 
structure is 


Ny, = (N, 0, S, <). 


We want to show that the theory. of this structure is (like Th Ns) decidable 
and also admits elimination of quantifiers. But unlike Th Ng, it is finitely 
axiomatizable and is not categorical in any infinite power. 

As axioms of Th Xs we will take the finite set Az consisting of the six 
sentences listed below. Here x < y is, of course, an abbreviation for (x < Y 
v x= y), and xÆ y abbreviates the negation of this formula. 


83. Vy O 58 0 —> Ax y ~ Sx) 
LI. VxVy (<Syexsy) 

L2. Vx x £0 

L3. VWxVy (x<yyxæyvyygx) 
LA. i Yx Vy (œx <y >y Kx) 

L5. Wx Vy Wz (x < y =y < Z= x < z) 


t This section may be omitted without disastrous effects, 


RENETI ahs aaan iiihanaa aii aiaia 
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We begin by listing some consequence of these axioms: 
(1) Ay H Vx x< Sx. 

Proof In L1 take y to be x. 

(2) A, LWxx Ex. 

Proof In L4 take y to x. E] 
(3) Ar H Yx Ve fyeys x) (trichotomy). 

Proof For “=>” use L3. For “‘<~” use L4 and (2). 5 
(4) Ar Vx Yy < y 4 Sx < Sy). 


Proof From Ár we can deduce the biconditionals: 


x<yory fx by (3); 
ee yp ESX by Ll; 

<> 8x Sy by (3); 

«> Sx < Sy by Li. 


(5) A, S1 and Az} 82. 


Proof S1 follows from L2 and (1). 52 comes from (4) by use of L3 
and (2). a 
(6) Arn | S4.n for n = 1,2,.... 
Proof This follows from (1) and (2), using L5. C 


Thus any model Ù of Az, is (when we ignore <%) also a model of Ag. 
So it must consist of a standard part plus zero or more Z-chains. In addition, 
it is ordered by <™. 


Theorem 32A The theory Cn Az, admits elimination of quantifiers. 
Proof Again we consider a formula 
Ax(Bo AeA Bo) 


where each £; is atomic or the negation of an atomic formula. The terms 


are, as in Section 3.1, of the form Sty, where u is 0 or a variable. There are 
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two possibilities for atomic formulas, 
Stu~ St and Stu < Sit, 


1. We can eliminate the negation symbol. Replace f € tg by h €h 
Vt, =f, and replace 4A t, by h < tY t<. (This is justified by 
L3 and L4.) By regrouping the atomic formulas and noting that 


Ixy y y) FS Axe y axy, 
we may again reach formulas of the form 
Ax(ay A+++ AQ), 


where now each a; is atomic. 


2. We may suppose that the variable x occurs in each a;. This is because 
if x does not occur in œ, then 


Jx(æ AB) H4 æa Ax Bp. 


Furthermore, we may suppose that x occurs on only one side of the equality 
or inequality œ;. For S*x ~ S'x can be dealt with as in Section 3.1. Stx 
<S'x can be replaced by 0 ~ 0 if k <l, and 00 otherwise. (This is 
justified by (4) and L2, together with the observation that 4, }0< x.) 


Case 1: Suppose that some œ; is an equality. Then we can proceed as 
in case 2 of the quantifier-elimination proof of Theorem 31G. 
‘Case 2: Otherwise each œ; is an inequality. Then the formula can be re- 
written 


ax(/ ti < Sx N Stix < w) ; 
i j 


(Here ‘A; indicates the conjunction of formulas indexed by i, so Yo A 41 
A!» Ayp can be abbreviated A;y:) In the first conjunction, Acti 
< Sx, we have the lower bounds on x, in the second conjunction, A ;S”* 
< uj, we have the upper bounds. If the second conjunction is empty (1.¢., 
if there are no upper bounds on x), then we can replace the formula by 
0 =~ 0. (Why?) If the first conjunction is empty (i.e., if there are no lower 
bounds on x), then we can replace the formula by 


A S0 < U5, 


which asserts that zero satisfies the upper bounds. Otherwise, we rewrite _ 


SESE SABES EES BIE SIRE A ca ca Rc NR ORE ee- 
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the formula successively as 


(1) dx N (h < Sr a Sux < uj). 
tJ : 
(2) dx A (St; < SUNY << S™u;). 
ij 
(3) ( A Sts < sm) A \ SO < uj. 
tJ > z 


This last formula says “any lower bound plus one satisfies any upper bound, 
and furthermore zero satisfies any upper bound.” This implies that there 
is a gap between the greatest lower bound and the least upper bound, 
whence there is a solution for x. The second part guarantees that the solution 
for x is not forced to be negative. 

In each case, we have arrived at a quantifier-free version of the given 
formula. a 


Corollary 32B (a) Cn Az, is complete. 
(b) Cn Ay = Th My. 
*(c) Th N; is decidable. 


Proof (a) The argument which followed the proof of Theorem 31G 
is applicable here also. (b) This follows from (a), since Cn Az S Th Nz 
and Th N; is satisfiable. For (c), we can use the fact that any complete 
axiomatizable theory is decidable. But the quantifier elimination proof yields 
a more efficient decision procedure. a 


Corollary 32C A subset of N is definable in Nz iff it is either finite or 
has finite complement. 


' Proof Compare Exercise 4 of the preceding section. 


On the other hand, Nz has more definable binary relations than has My. 
For the ordering relation {<m, n> : m < n} is not definable in Ns, by Exer- 
cise 5 of the preceding section. 


Corollary 32D The addition relation, 
{<m, n, p) :m+n= P}, 
is not definable in Nz. 


Proof If we could define addition, we could then define the set of even 
natural numbers. But this set is neither finite nor has finite complement. M 
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Now suppose we augment the language by the addition symbol +. The 
intended structure is 


Na == (N, 0, S, <, +). 


The theory of this structure is also decidable, as we will prove shortly. 
But to keep matters from getting even more complicated, we will avoid 
listing any convenient set of axioms for’ the theory. 

The nonstandard models of Th N, must also be models of Th Nz. So 
they have a standard part, followed by some Z-chains. But ordering among 
the Z-chains can no longer be arbitrary. Let Uf be a nonstandard model of 
ThNa. The ordering <% induces a well-defined ordering on the set of 
Z-chains. (See Exercise 3.) We claim that there is no largest Z-chain, there 
is no smallest Z-chain, and there is between any two Z-chains another one. 
The reasons, in outline, can be stated simply: If a belongs to some Z-chain 
(i.e. is an infinite element of Y), then a +" a is in a larger Z-chain. There 
must be some b such that b +% b is either a or its successor; b must be in 
a smaller Z-chain. If a, and a, belong to different Z-chains, then there must 
be some b such that b + b is either a, -+% a, or its successor. And b will 
lie in a Z-chain between that of a, and that of a,. (These statements should 
seem quite plausible. The reader who enjoys working with infinite numbers 
might supply some details.) l 


*Theorem 32E (Presburger, 1929) The theory of the structure Na 
= (N,0, S, <, +) is decidable. 


The proof is again based on a quantifier elimination procedure. The theory 
of Ny itself does not admit elimination of quantifiers. For example, the for- 
mula defining the set of even numbers 


dyy~yty 


is not equivalent to any quantifier-free formula. We can overcome this by 
adding a new symbol 2%, for congruence modulo 2. Similarly, we add sym- 
bols z, 4, ... . The intended structure for this expanded language is 


Rt = (N, 0, S, <, +, =; =;, Kh 


where ==} is the binary relation of congruence modulo k. It turns out 
that the theory of this structure does admit elimination of quantifiers. 

This by itself does not imply that the theory of either structure is decid- 
able. After all, we can start with any structure, and expand it to a structure 
having additional relations until a structure is obtained that admits elimina 
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tion of quantifiers. To obtain decidability, we must show that we can, 


given a sentence o, (1) effectively find a quantifier-free equivalent o’, and 


-then (2) effectively decide if o’ is true. 


We will now give the quantifier elimination procedure for N+. For a 
term ¢ and a natural number n, let nt be the term ¢ +f +++) -+ z, with 


n summands. Of is 0. Then any term can be expanded to one of the form 


S0 -H nix bors YX, 
for k > 0, n; > 0. For example, 


S(x + 50) 
becomes 
s0 4+- x. 


As usual we begin with a formula Ay(6, A 


-+» A Pn), where f; is an 
atomic formula or the negation of one. 


i. Eliminate negation. Replace (h = t) by A<&VhAa< 4). Re- 
place (4 < t) by (G = ta V b < h). And replace —(t, Sm t2) by 


ty in te t SIO v +++ V ty Rein fe t gmg, 


Then regroup into a disjunction of formulas of the form 


Jyo Art A Om)» 


where each oœ; is atomic. We may further suppose that y occurs in each o, 
and in fact that a; has one of the four forms 


ny +t u, 

ny +t Rn, 

ny i< u, 
u< ny +t, 


where u and ¢ are terms not containing y. In what follows we will take 
the liberty of writing these formulas with a subtraction symbol: 


ny = u—t, 
AY Fn U t, 
ny < u-t, 


-u i< ny. 
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These are merely abbreviations for the formulas without subtraction ob- 
tained by transposing terms. 
For example, we might have at this point the formula 


dpQv <4ya2y<ua3y Lv A yZ t), 


where ¢, u, v, and w are terms not containing y. 

2. Uniformize the coefficients of y. Let p be the least common multiple 
of the coefficients of y. Each atomic formula can be converted to one in 
which the coefficient of y is p, by “multiplying through” by the appropriate 
factor. This is obviously legitimate for equalities and inequalities. In the 
case of congruences one must remember to raise the modulus also: 


a =nb iff ka =,,, kb. 


In the example above p is 12, and we obtain 


AyGw < 12y A lay < bun 12y < 4v A 12y Begg 12t). 


3. Eliminate the coefficient of y. Replace py by x and add the new 
conjunct x %0. (In place of Ay +++ 12y +++ we can equally well have, 
“There exists a multiple x of 12 such that ...x 
converted to 


IxBw <x ax < 6urnx < 4v A X Bigg 12t A x fZ 9). 


4, Special case. If one of the atomic formulas is an equality, x + f = u, 
then we can replace 
dx 6 
by 
OZ at <u. 


Here replacement of x by “u — t” is the natural thing; we transpose terms 
to compensate for the absence of subtraction. For example, 


(X Rn. iS URZ bh 


5. We may assume henceforth that ~ does not occur. So we have 4 
formula of the form 


Ax[ro — S KK A Afa Sy <x 


AX Sly —UQ A+ "AX < thy T Up_y 


AX Im Vo — Woh tts AX Finn Vay T Waal, 


..”) Our example is now 


FEBS SS 


Se ee e 
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where Fi, Sis fi, Ui, Vis and w; are terms not containing x. This can be 
abbreviated 
Ax| A r;—sj3<ixn Ax<t;—uja A ¥ Bing Yi ~ wil. 
j<i i<k i<n 
If there are no congruences (i.e., n = 0), then the formula asserts that 


there is a nonnegative space between the lower and upper bounds. We 
can replace the formula by the quantıfier-free formula: 


A A (tj — s) + 80 < t; — ua A0<t-— 4; 
i<k j<l i<k 
Let M be the least common multiple of the moduli mo, ..., Mp- 
Then a + M =p a. So as a increases, the pattern of residues of a modulo 
Mo, . ++, Mp has period M. Thus, in searching for a solution to the con- 


gruences, we need only search M consecutive integers. 

We now have a formula which asserts the existence of a natural number 
which is not less than certain lower bounds L,, ..., 2; and which satis- 
fies certain congruences and certain upper bounds. If there is such a sos 
lution, then one of the following is a solution: 


£441, yl + M-l, 
La l +1, oolp M-I, 
Pied vey Ly + M— 41, 
(iced. 


(The last line is needed to cover the case in which every L; is negative. 
To avoid having to treat this line as special case, we will add a new lower 
bound of 0. That is, let r, = 0 and s; = SO so that 


ry Sp 


is a formula 0 < x -+ SO asserting that x is nonnepative, We now have 
I++ 1 lower andes 
Our formula (asserting the existence of a solution for x) can now be 
replaced by a quantifier-free disjunction which asserts that one of the num- 
bers in the above matrix is a nonnegative solution: 


Arosa 


jsl q<M [A 


a N j- 


i<n 


ga 5;) of SMA Ae Sj) -- S10 < f; — Uy 
i i<k 


- 83) + S0 Sm; ti T v= wi, 
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In our continuing example wè have, after adding the new lower bound 


on x, 


AxQw <x A0 <x + SOA x < 6u AX <4 AX Begg 12t A x Rie 0). 


The quantifier-free equivalent is a disjunction of seventy-two conjunctions. 
Each conjunction has six constituents. 

This proves half of the theorem. If we are given a sentence o, the above 
procedure tells us how to find effectively a quantifier-free sentence + (in 
the language of Nt) which is true (in the intended structure) iff o is. Now 
we must decide if v is true. 

But this is easy. It is enough to look at atomic sentences. Any variable- 
free term can be put in the form S”0. Then, for example, 


S"0 Ay S70 
is true iff n =,, p. . 


A set D of natural numbers is periodic if for some positive p, a number 
n is in D iff n + p is in D. D is eventually periodic iff there exist positive 
numbers M and p such that for all n greater than M, n € D ifn + pe D. 


Theorem 32K A set of natural numbers is definable in (N, 0, S, <, +) 
iff it is eventually periodic. 


Proof Exercise 1 asserts that every eventually periodic set is definable. 
Conversely, suppose D is definable. Then D is definable in N+ by a quan- 
tifier-free formula (whose only variable is v,). Since the class of eventually 
periodic sets is closed under union, intersection, and complementation, it 
suffices to show that every atomic formula in the language of Nt whose 
only variable is », defines an eventually periodic set. There are only four 
possibilities: i ; 
nv t é= u, 
ny +i< u, 

uZ nv, +f, 


nv, + É Ry, 4, 


where u and ¢ are numerals. The first two formulas define finite sets (which _ 


eventually have period 1), the third defines a set with finite complement, 
and the last formula defines a periodic set with period m. 


i 
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Corollary 32G The multiplication relation 
{m,n py :p=m-n in N} 
is not definable in (N, 0, S, <, +). 


Proof If we had a definition of multiplication, we could then use that . 
to define the set of squares. But the set of squares is not eventually pe- 


‘riodic. a 


EXERCISES 


1. Show that any eventually periodic set of natural numbers is defin- 
able in the structure N4. 


2, Show that in the structure (N, +) the following relations are defin- 
able: 

(a) Ordering, {<m, n> 1m <n}. 

(b) Zero, {0}. 

(c) Successor, {<m, n> : n = S(m)}. 


3. Let X be a model of Th N, (or equivalently a model of 4z). For a 


and bin |Y | define the equivalence relation: 


a ~b <8" can be applied a finite number of times to one of a, b to 
reach the other. 


Let [a] be the equivalence class to which a belongs. Order equivalence classes 
by 
la) < [b] iffa<™b and ab. 


Show that this is a well-defined ordering on the set of equivalence classes. 


4. Show that the theory of the real numbers with its usual ordering, 
TA(R, <), admits elimination of quantifiers. (Assume that the language 
includes equality.) 


§3.3 A SUBTHEORY OF NUMBER THEORY 


We now return to the full language of number theory, as described in 
Section 3.0. The parameters of the language are Y, 0,85, <, +, °, and ` 


E. The intended structure for this language is 


R= (N, 0, S, Sh E). 
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Actually in (N, +, E) we can define {0}, S, <, and +. (See Exercise 1.) 
As we will show in Section 3.7, in (N, +, +) we can define E as well as 0, 
S, and <. So there are ways in which we could economize. The luxury 
of having all these parameters (particularly E) will simplify some of the 
proofs. Ms 

As we shall see, Th ® is a very strong theory and is neither decidable 
nor axiomatizable. In order to prove this fact (and a number of related re- 
sults) it will be strategically wise to select for study a finitely axiomatizable 
subtheory of Th N. As hinted at in Section 3.0, this subtheory should be 
strong enough to represent (in a sense.to be made precise) fects «bout 


decidable sets. The subtheory we have selected is Cn Ag, where Ap is the’ 


set consisting of the eleven sentences listed below. (As in the preceding 
section, x Z y abbreviates x < y V x = y.) l 


SET Ay OF AXIOMS 


Sl. Vx Sx bo 

52. Wx Vy (Sx = Sy > x = y) ` 
LI. Vx Vy (x < Sy x < y) 
L2. Yx x0 

L3, Vx Vy xcyvx~yvy <x) 
Al, Vx x«+0~x 

A2. Vx Vy x + Sy ~ S(x + y) 
Mi. Yx x°-0+0 

M2. YxVyx.Sy=x.y +x 
El. Wx x E0 ~ S0 

E2. YxVyxESy=~xEy.x 


Since N is a model of Ag, we have Cn Ag S Th 9. But (as we will prove) 
equality does not hold here. In fact, it can be shown that Ag / 53. 

We first show that certain simple sentences in Th % are deducible 
from Ag. 


Lemma 33A (a) Ag | Vxx £0. 
(b) For any natural number k, 


Ag HY x(x < SEO 4 x =~ Sv +++ vx ~ S*0). 
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Notice that (a) can be thought of as the k = —1 case of (b), where the 


empty disjunction is L. The lemma tells us that the standard points are 


ordered in the natural way, and (by L3) the infinite points are all larger 
than any standard point. 


Proof Part (a) is L2. For (b) we use induction (in English) on k. We 
have as a consequence of L1, i 


x < S04 x <0 v x= l, 


which together with L2 gives 


x < 80 <4 x = 0, 


which is the k = 0 case of (b). For the inductive step we again apply LI: 
x < SHO < x < S40 v x ~ S40. 
By the inductive hypothesis, x < S*0 can be replaced by 
x= SOV) vx ~ S10, 


whereby we obtain (b). E 


Lemma 33B For any variable-free term f, there is a unique natural 


number n such that 
Ay H t == S70. 


Proof The uniqueness is immediate. (Why?) For the existence half, 
we use induction on ż. It fis 0, we take n = 0. If f is Su, then by the inductive 
hypothesis Ay} u~S"0 for some m. Hence Ag} t= S™*0, 

Now suppose f is u, + us. By the inductive hypothesis Ag |} £ = S”0 
-+ S"0 for some m and n. We now apply A2 n times and A1 once to obtain 
Ay | t = S79, The arguments for multiplication and exponentiation are 
similar. 


As a special case of this lemma we have “2 + 2 = 4” (i.e, S?O -+ S70 
= S10) as a consequence of Aj. 


Theorem 33C For any quantifier-free sentence t+ true in N, Ay} rt. 
Proof Exercise 2. » 


A simplified notation (used earlier in Section 2.7) for substitution will 
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be helpful in the coming pages: 


pt) = ot; 
P(t, to) = (ptt a 


and so forth. Thus » = 9(v,) = y(v,, v2). Usually the term substituted 
will be a numeral, for example 


P(S%0, S°0) = (Pho) So: 


But at times we will also substitute other terms, e.g., g(x) = g$, where 
x is a variable. If, however, x is not substitutable for », in y, then we must 
take v(x) = yz, where y is a suitable alphabetic variant of g. 

In the next proof (and elsewhere in this chapter) we make use of the 
following consequence of the substitution lemma of Section 2.5: For a 
formula g in which at most v,, ..., », occur free and for natural numbers 
COEN: om 


Fn? lo, teeny an] > Eg p(S%0, ay S2n9). 


An existential (3,) formula is one of the form 3x, +++ 3x, 0, where 6 
is quantifier-free, The following result improves Theorem 33C: 


Corollary 33D If r is an existential sentence true in R, then Ap | t. 
Proof If 3v, Av, 9 is true in 9, then for some natural numbers m and ^, 


O(S™0, S”0) is true in N. As this is a quantifier-free true sentence, it is de- 
ducible from Ag. But it in turn logically implies Av, 3v, 6. a 


On the other hand, it is known that there are true universal (Y4) sentences 
(i.e, of the form Vx, -+- Vx, 0 for quantifier-free 0) which are not in 
Cn Ag. 


Representable relations 


Let R be an m-ary relation on N; i.e., R S N”. We know that a formula 

e (in which only v,, ... , Ya occur free) defines R in N iff for every aj, ... 5 Om 

in N, ‘ 
<a, A sAm? E R< ZICH sse » Am] 

<> Fy (S40, ... , S0). 


(The last condition here is equivalent to the preceding one by the sub- 
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stitution lemma.) We can recast this into two implications: 


lli, ++) On E R = Fne(SO, ..-, Sn), 
dy, o. Amy E R= Fe S0, ..., S%0). 


We will say that ọ also represents R in the theory Cn A, if in these two 
implications the notion of truth in 9% can be replaced by the stronger notion 


of ‘deducibility from Ag. . . 
More generally, let T be any theory in a language with 0 and S. Then 


o represents R in T iff for every a1, ..+> Om in N: 
(dys 0+ +5 Om E R= (SO, ..., S0) € T, 
llis 205 Am E R = (S0, ... , S%0)) E T. 


For example, ọ represents R in the theory Th N iff ọ defines R in N. But e 
represents R in Cn Apg iff for all ay, .. -s Om: 


llis ++) Am? E R = Apg | (80, <a SmO), 
lai, see , Am? € R => Ag H —=0(S%0, eee , S20). 


The equality relation on N, for example, is represented in Cn Ag by the 
formula.v, = V. For 


m=n=>|+S"0 ~ 80, 
m Æ n = {SI, S2} H 7S"0 ~ 870. 


A relation is representable’in T iff there exists some formula which repre- 
it in T. 

ies aan of representability should be compared with that of defin- 
ability. In both cases we are somehow describing relations on the ce 
numbers by formulas. In the case of definability, we ask about the trut 
of sentences in the interpretation. In the case of -representability in Cn Ag, 
we ask instead about the deducibility of sentences from the axioms. 

Say that a formula g, in which no variables other than v1, ..., Um ee 
free, is. numeralwise determined by Ay iff for any m-tuple a), ...,4m O 
natural numbers, either 


Ag H (S20, ..., 8%) 


al a 
Ap + —=g(S70, on ey S09). 
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Theorem 33E A formula ọ represents a relation R in Cn A, iff 


(1) @ is numeralwise determined by Az, and 
(2) @ defines R in K. l 


Proof We use the fact that Ñ is a model of Ap. If ọ represents R in 
Cn Ap, then it is clear that (1) holds; (2) holds since “4y H” implies “Hg.” 
Conversely, if (1) and (2) hold, then we have 


Calis eey Am? E R = Fy o(S%0, ... , S0) by (2) 
= Ág - 70(S%0, ... , S0) since M is a model of Ag 
=> Az H 0(S@0, ... , S0) by (1). 

Similarly for the complement of R and —o. m 


Church’s thesis 


We now turn to the relationship of the concepts of representability and 
decidability. 


*Theorem 33F Assume that R is a relation representable in a consistent 
axiomatizable theory. Then R is decidable. 


Proof Say that @ represents R in the consistent axiomatizable theory T. 
Recall that T is effectively enumerable (Corollary 25F). The decision pro- 
cedure is as follows: 


Given a,,...,@,, enumerate the members of T. If, in the enumera- 
tion, e(S%0, ... , SO) is found, then we are done and <a}, ..., dm E R. 
If, in the enumeration, -e(S“0, ..., S%0) is found, then we are done 

and <a,,...,a,.) ÈR. i 

By the representability, one sentence or the other always appears even- 
tually, whereupon the procedure terminates. Since T is consistent, the answer 
given by the procedure is correct. a 


*Corollary 33G Any relation representable in a consistent finitely 
axiomatizable theory is decidable. 


What about the converse to the above corollary? We cannot really hope 
to prove the converse on the basis of our informal notion of decidability: 
For our informal approach is usable only for giving lower bounds on the 
class of decidable relations, and is unsuited to giving upper bounds. 


It is nevertheless possible to make plausibility arguments in support 


| 
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of the converse. This will be easier to do at the end of Section 3.4 than here. 
Roughly, the idea is that in a finite number of axioms we could ca 
(finitely long) instructions for the decision procedure. 

The assertion that both the above corollary and its converse are correct 
is generally known as Church’s thesis. This assertion is not really a mathe- 
matical statement susceptible of proof or disproof; rather it is a judgment 
that the correct formalization of the intuitive notion of decidability is ‘by 
means of representability in consistent and finitely axiomatizable theories, 


pture the 


Definition A relation R on the natural numbers is recursive iff it is rep- 


resentable in some consistent finitely axiomatizable theory (in a language 
with © and S). . 


Church’s thesis now can be put more succinctly: A relation is decidable 
iff it is recursive. Or perhaps more accurately: The notion of recursiveness 
is the correct precise counterpart to the intuitive notion of decidability, 
The situation is analogous to one encountered in calculus. An intuitively’ 
continuous function (defined on an interval) is one whose gr 
draw without lifting your pencil off the paper. But to prove theorems, some 
formal counterpart of this notion is needed. And so one gives the usual 
definition of e-d-continuity. One should ask if the precise notion of e-d- 
continuity is an accurate formalization of intuitive continuity. If anything, 
the class of e-d-continuous functions is too broad. It includes nowhere dif- 
ferentiable functions, whose graphs cannot be drawn without lifting the 


aph you can 


pencil. But accurate or not, the class of e-ĝ-continuous functions has been 


found to be a natural and important class in mathematical analysis. 
Very much the same situation occurs with recursiveness. One should 


ask if the precise notion of recursiveness is an accurate formalization of 


the intuitive notion of decidability. Again, the precisely defined class (of 
recursive relations) appears to be, if anything, too broad. It includes rela- 
tions for which any decision procedure will, for large inputs, require so 
much computing time and memory (“‘scratchpad’’) space as to make im- 
plementation absurd. Recursiveness corresponds to decidability in an ide- 
alized world, where length of computation and amount of memory are 
disregarded. But in any case, the class of recursive relations has been found 
to be a natural and imporant class in mathematical logic. 

Empirical evidence that the class of recursive relations is not too narrow 
is provided by the following: l 


1. Any relation considered thus far which mathematicians have felt was 
decidable has been found to be recursive. 
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Theorem 33E A formula ọ represents a relation R in Cn Ag iff 


(1) @ is numeralwise determined by Ag, and 
(2) o defines R in MN. l 


Proof We- use the fact that N is a model of Ag. If ọ represents R in 
Cn Apg, then it is clear that (1) holds; (2) holds since “Ag +” implies “Hy.” 
Conversely, if (1) and (2) hold, then we have 


(Ay, o.e s Am? E R => Fy o(S%0, ... , S20) by (2) . 
=> Ap - ~0(S%0, ... , S%0) since Nis a model of Ag 
=> Ap | 0(S@0, ... , S20) by (1). 

Similarly for the complement of R and —. | 


Church’s thesis 


We now turn to the relationship of the concepts of representability and 
decidability. 


*Theorem 33F Assume that R is a relation representable in a consistent 


- axiomatizable theory. Then R is decidable. 


Proof Say that ọ represents R in the consistent axiomatizable theory T. 
Recall that 7 is effectively enumerable (Corollary 25F). The decision pro- 
cedure is as follows: 


Given @,,...,@,, enumerate the members of T. If, in the enumera- 
tion, e(S“0, ..., S%0) is found, then we are done and <a, ...., a, E R. 
If, in the enumeration, ~e(S"0, ..., S%0) is found, then we are done 
and <a, ...,d,)> ER. 

By the representability, one sentence or the other always appears even- 
tually, whereupon the procedure terminates. Since T is consistent, the answer 
given by the procedure is correct. E 


*Corollary 33G Any relation representable in a consistent finitely 
axiomatizable theory is decidable. 


What about the converse to the above corollary? We cannot really hope 
to prove the converse on the basis of our informal notion of decidability. 
For our informal approach is usable only for giving lower bounds on the 
class of decidable relations, and is unsuited to giving upper bounds. 

It is nevertheless possible to make plausibility arguments in support 
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of the converse. This will be easier to do at the end of Section 3.4 than here. 
Roughly, the idea is that in a finite number of axioms we could capture the 
(finitely long) instructions for the decision procedure. 
The assertion that both the above corollary and its converse are correct 
is generally known as Church’s thesis. This assertion is not really a mathe- 
matical statement susceptible of proof or disproof; rather it is a judgment 
that the correct formalization of the intuitive notion of decidability is by 
means of representability in consistent and finitely axiomatizable theories. 


Definition A relation R on the natural numbers is recursive iff it is rep- 
resentable in some consistent finitely axiomatizable theory (in a language 
with 0 and S). 


Church’s thesis now can be put more succinctly: A relation is decidable 
iff it is recursive. Or perhaps more accurately: The notion of recursiveness 
is the correct precise counterpart to the intuitive notion of decidability. 
The situation is analogous to one encountered in calculus. An intuitively 
continuous function (defined on an interval) is one whose graph you can 
draw without lifting your pencil off the paper. But to prove theorems, some 
formal counterpart of this notion is needed. And so one gives the usual 
definition of e-d-continuity. One should ask if the precise notion of e-d- 
continuity is an accurate formalization of intuitive continuity, If anything, 
the class of e-d-continuous functions is too broad. It includes nowhere dif- 
ferentiable functions, whose graphs cannot be drawn without lifting the 


pencil, But accurate or not, the class of ¢-6-continuous functions has been 


found to be a natural and important class in mathematical analysis. 
Very much the same situation occurs with recursiveness. One should 


“~ ask if the precise notion of recursiveness is an accurate formalization of 


the intuitive notion of decidability. Again, the precisely defined class (of 
recursive relations) appears to be, if anything, too broad. It includes rela- 
tions for which any decision procedure will, for large inputs, require so 
much computing time and memory (‘“scratchpad”) space as to make im- 
plementation absurd. Recursiveness corresponds to decidability in an ide- 
alized world, where length of computation and amount of memory are 
disregarded. But in any case, the class of recursive relations has been found 
to be a natural and imporant class in mathematical logic. 

Empirical evidence that the class of recursive relations is not too narrow 
is provided by the following: 


1. Any relation considered thus far which mathematicians have felt was 
decidable has been found to be recursive. 
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2. Several people have tried giving precise definitions of idealized com- 
puting agents (e.g., Turing machines, introduced by Turing in 1936, or 
the register machines described in Section 3.8.) The idea was to devise 
something that could carry out any effective procedure. In all cases, the 
class of relations having decision procedures executable by such a comput- 
ing agent has been exactly the class of recursive relations. 


The fact that so many different (yet equivalent) definitions for the class 
of recursive relations have been found is some indication of the naturalness 
and importance of the concept. 

In this book we will continue to exclude the intuitive notion of decid- 
ability from nonstarred theorems. But in the remainder of the exposition 
we will accept Church’s thesis. For example, we will speak of a set’s being 
undecidable when we have a theorem stating it to be nonrecursive. 


Obviously any relation representable in Cn A, is recursive. We will prove | 


later that the converse also holds; if a relation is representable in any con- 
sistent finitely axiomatizable theory, then it is representable in the one theory 
we have selected for special study. (This was, of course, a motivating factor 
in our selection.) 

The functional analog of a decidable relation is a computable function. 


*Definition A function f : N*— N is computable iff there is an effective 
procedure which, given any k-tuple dof natural numbers, will produce /(@). 


For example, addition and multiplication are computable. Effective 
procedures for these functions are taught in the elementary schools. On 
the other hand, of the uncountably many functions from N* into N, only 
countably many can be computable. 

We want to give a mathematical counterpart to the intuitive concept 
of computability, just as in the case of decidable relations. The clue to 
the correct counterpart is provided by the next theorem. Recall that any 
function f: Nt -> N is-also a (k + 1)-ary relation on WN: 


ay, vive ak, bD) Ef f(a, ss Ay) = D. 


At one time it was popular to distinguish between the function and the 
relation (which was called the graph of the function). Current set-theoretic 
usage takes a function to be the same thing as its graph. But we still have 
the two ways of looking at the function. 


*Theorem 33H The following three conditions on a function f : Në > N 
are equivalent: 


' 
J 
i 
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(a) f is computable. 
(b) When viewed as a relation, f is a decidable relation. 
(c) When viewed as a relation, f is an effectively enumerable relation. 


Proof (a) = (b): Assume that fis computable; we will describe the deci- 
sion procedure, Given <a,, ..., ap, D>, first compute f (az, ..., a). Then 
look to see if the result is equal to b. If it is, say “yes”, otherwise say “no.” 

(b) = (c): Any decidable relation is effectively enumerable. For we can 
enumerate the set of all (k + 1)-tuples of numbers, and place on the output 
list those which meet the test of belonging to the relation. 

(c) = (a): Assume that we have an effective enumeration of (the graph 
of) f. To compute f(a,,..., ap) we examine the (k + 1)-tuples in the 
enumeration until we find the one that begins with a4, ..., ap. Its last 
component is then the desired function value. B 


Thus by using Church’s thesis, we can say that f'is computable iff f (viewed 
as a relation) is recursive. The class of recursive functions is an interesting . 
class even apart from its connection with incompleteness theorems of logic. 
It represents an upper bound to the class of functions which can actually 
be computed by programs for digital computers. The recursive functions 
are those which are calculable by digital computers, provided one ignores 
practical limitations on computing time and memory space. 

We can now describe our plans for this section and the next. Our basic 
goal is to obtain the theorems of Section 3.5, But some groundwork is 
required before we can prove those theorems; we must verify that a number 
of relations (intuitively decidable) and a number of functions (intuitively 
computable) are representable in Cn Ag and hence are recursive. In the 
process we will show (Theorem 34A) that recursiveness is equivalent to 
representability in Cn Ay. In the remainder of the present section we will 
establish general facts about representability, and will show, for example, 
that certain functions for encoding finite sequences of numbers into single 
numbers are representable. Then in Section 3.4 we apply these results to 
particular relations and functions related to the syntactical features of the 
formal language. 

The author is sufficiently realistic to know that many readers will be more 
interested in the theorems of Section 3.5 than in the preliminary spadework. 
If the reader is willing to believe that intuitively decidable relations are all 
representable in Cn Ag, and intuitively computable functions are function- 
ally representable (a concept we will define shortly) there, then most if 
not all of the preofs in this spadework become unnecessary. But it is hoped 
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that the definitions and the statements of the results will still receive some 


attention. Case 2: Otherwise for every c less than a, 


(3) Ay + ~9(S0, S20, S°0). 


Numeralwise determined formulas 
(This case occurs iff Vx(x < SO —> g(x, 5°, St0)) is true in R.) We 


Theorem 33E tells us that we can show a relation to be representable know from Lemma 33A that 


in Cn Ay by finding a formula which’ defines it in N and is numeralwise | 
determined by Ag. The next theorem will be useful in establishing numeral- (4) Ap H Wx(x < S20 > x = SW v +++ vx a S410). 
wise determination. 
The sentence in (4) together with the sentences in (3) (for c = 0, ..., a — 1) 
Theorem 331 (a) Any atomic formula is numeralwise determined by Ag. imply > fs 


(b) If p and y are numeralwise determined by Ay, then so are ~p and Wx(x < S0 — —y(x, S70, S°0)). 


p => p. 
(c) If p is numeralwise determined by Apg, then so are the following for- And this is equivalent to 
mulas (obtained from @ by “b ification”): 
( om y by “bounded quantification”): | AAx(x < 8° A p(x, SO, S%)). 
Vx(x < y = p), | 7 i ; ; 
: This shows that Ix(x < y A g(x, y, z)) is numeralwise determined by 
x(x < y AQ). An. By applying this result to ~p we obtain the fact that the dual formu- 
p 
la, Vx(x < y = p(x, y, 2)), is numeralwise determined by A, as well. ial 


Proof of (c) We will consider a formula | 
The argument in case 2 relied on the fact that the x quantifier was bounded 


dx(x < y A glx, y, Z 
: LO) by S20. We will see that it is possible for 


in which just the variables y and z occur free. Consider two natural numbers 
ree, ers 3 0 i 
a and b; we must show that either WS), VSD) 500 


Ag + Ax(x < S0 A p(x, SO, S°0)) | all to be consequences of Ay without having 


or Wx (x) 


The preceding theorem is a useful tool for showing many relations to 


Ag H mAx(x < S0 a plx, S10, S°0)). be a consequence. 
: be representable in Cn Ag. For example, the set of primes is represented by 


Case 1: For some c less than a, 
(1) _ , 4e F 89, 5%0, 870). | S10 <9 A Vafa < vy > VY <-> x + y AE v)). 


(This case occurs iff Ix(x < S%0 A p(x, S70, S0)) is true in N.) We also 
have 


(2) Ag | S0 < S% 


This formula defines the primes in N, and by the preceding theorem is nu- 
meralwise determined by Ay. It therefore represents the set of primes in 
Cn Ág. i l 


by (a). And the sentences in (1) and (2) imply the sentence : Representable functions 


Often it is more convenient to work with functions than with relations. 


Fx(x < S70 a g(x, SO, S0)). 
i Let f : N” + N be an m-place function on the natural numbers. A for- 
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mula p in which only »,,...,%, 4, occur free will be said to functionally 
represent f (in the theory Cn Ar) iff for any a,,...,d, in N, 


Ar Wi mslp(S0, «2. , S0, Vyr) > Vg = SMG edQ], 


(Observe that the “<—” half of this sentence is equivalent to (S%0, ... , S%0, 
S/1,..4m9), The “-»” half adds an assertion of uniqueness.) 


Theorem 333 If pọ functionally represents f in Cn Ag, then it also 
represents f (as a relation) in Cn Ag. 


Proof, with m = 1 Since g functionally represents f, we have for any b: 
Ay | 9(S70, S°0) +> S°0 ~ S490, 


If <a, b> € f, ie., if f(a) = b, then the right half of this biconditional is 
valid and we get 


An | 9(S%, 80). 


But otherwise the right half is refutable from Ap (i.e., its negation is deduc- 
ible), whence 


Ar H “g(s, S20). @ 
The converse of this theorem is false. But we can change the formula: 
Theorem 33K Let f be a function on N which is (as a relation) repre- 
sentable in Cn Ay. Then we can find a formula g which functionally rep- 


resents fin Cn Ap. 


Proof To simplify the notation we will take fto bea one-place function 
on N. The desired sentence, 


Vv.[9(S20, v2) 4 v ~ SMO], 


is equivalent to the conjunction of the two sentences 


(1) 9(S*0, S10) 
and 
(2) - V va[@(S0, vs) —> v, ~ SO], 


The sentence (1) is a theorem of Ag whenever ¢ represents f. The sentence 


i 
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(2) is an assertion of uniqueness; we must construct p in such a way that - 
this will also be a theorem of Ag. . . 
Begin with a formula 6 known to represent f (as a binary relation). 


Let p be 
O(v,, va) A W2(z < v > 70%, Z)). 


We can then rewrite (2) as 

(2) Wv,[0(S%0, va) A V2(z < va > 0(S"0, z)) > v, = SMO]. 

To show this to be a theorem of Az, it clearly suffices to show that 
Ay U {0(S%, va), VZ(Z < v —> —0(S20, '2))} F 2a = S190. 


Call this set of hypotheses (to the left of “}H”) T. Since L3 € Apg it suffices 
to show that 


(3) Pho, £8! 
and 
(4) i T j SAO £ va. 


It is easy to obtain (4), since from the last member of J’ we get 


Shing < v, — 6(S0, S10) 


and we know that 
(5) Ag | (80, S10), 


To obtain (3) we first note that we have as theorems of Apr, 


(6) Vy < SIOD o> vy = S00 v <- v p = SMO-10 
and 
(7) ($0, 8°0) for b=0,..., f(a) — |. 


The formulas (6) and (7) imply the formula 
8) p, < S100 > 70S, va). 


Since 6(S20, v,) € T, we have (3). 


~ which is a quantifier-free sentence true in N. (Here t(u,, .. 
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_ This shows (2) to be a theorem of Aj; (5) and (8) show (1) to be a theorem 
of Ag as well. l “ 


We next want to show that certain basic functions are representable 
(in Cn Áp) and that the class of representable functions has certain closure 
properties. Henceforth in this section, when we say that a function or rela- 
tion is representable, we will mean that it is representable in the theory 
Cn Ag. But the phrase “in Cn Ag” will usually be omitted. 

‘In simple cases, an m-place function might be represented by an equation 


Yap S L 


In fact, any such equation, when the variables in ¢ are among %, 
defines in N an m-place function f. (The value of f at <a, ..., am? is the 
number assigned in N to ¢ when v; is assigned a;, 1 < i < m.) Furthermore, 
we know that any equation is numeralwise determined by Ag, so the 
equation represents f as a relation. In fact, it even functionally represents 
J, for the sentence 


e. Vms 


VP names = S90, ... , S00) <> Vy py a SaO] 
is logically equivalent to 


(S40, ..., S0) a2 S0100, 


. , Um) is the term 
obtained by replacing v, by u, then v, by ug, etc.) For example: 


1. The successor function is represented (functionally) by the equation, 
Va = Sv. 


2. Any constant function is representable. The m-place function which 
constantly assumes the value b is represented by the equation 


Vm = 8°0. 


3. The projection function 
(ay, tee >» Am) = h 
is represented by the equation 


Vmi Vi. 
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| 4. Addition, multiplication, and exponentiation are represented by the 
| equations 

J Va = 0) + V2, i 

i Vz = Vy ° V2, : 

| v, = 0, E v, 

respectively. 


The reader should not be misled by these simple examples; not every 

representable function is representable by an equation.. l l . 
| We next want to show that the family of representable functions 1s closed 
under composition. To simplify the notation, we will consider a one-place 
function f on N, where 


| fla) = g(h la), h(a). 


| Suppose that g is f unctionally represented by y and h; by 0;. To represent 
f it would be reasonable to try either 


Vy V y0 > OPi, Yo) > pi» Yas Y2)) 


or 
i ay DAUGE V1) A O(P, Va) A POr Va» va)). 


| (Think of pi, Y2» v) as saying “8i Yo) = v” and think of 0,(v,, yi) 
as saying “hiw) = y.” Then the first formula translates, For any yy, 
+ Pa, if ha(w) = V1 and h,(v;) = Ye, then g(r» Ya) = Ya.” The second ies 
translates, “There exist y1, Y2 such that Aw) = Ya and hv) == p an 
gO, Y2) = ve” Either one is a reasonable way of saying, “enn he(vy)) = 
= 7 » There are two choices, because when something is unique, either 
Sons 2° a . 
quantifier can be used for it.) 

Actually either formula would work; let p be 


Wy V yO 0 , J) > OVis Yo) > Wr, Yas v,)). 


Consider any natural number a; we have at our disposal 
(1) Vogp(StO0, SiO, va) <r Ve ~ Sng]. 
VWy,[0,(80, y) e yi ~ Ss*o0]. 


V y:[0(S°0, Yo) > Va © S(O]. 


Vv,(p(S%, v2) > 22 ~ S10), 
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ie, l 
(4) YoY y Yy S0, y1) > 8S0, ya) —>-y(V1, Ya, Va +> va ~= S100). 


But (1), (2), and (3) imply (4), as the reader is asked to verify in Exercise 4, 
More generally we have . 


Theorem 33L Let g be an n-place function, let 74, 


<., Ap be me-place 
functions, and let f be defined by 


f(a, +s Am) = ghia, oe -> Am)» tae hay, ee s Un) 


From formulas functionally representing g and 7, ..., A, we can find a 
formula which functionally represents f. 


In the above proof we have m = 1 and n = 2. But the general case is 
proved in exactly the same way. 


In order to obtain a function such as 
IG b) = g(A(a), b), 
we note that 
f(a, b) = gha, 6)), Ta, b)). 


The above theorem then can be applied (twice) to show that fis represent- 
able (provided that g and A are). 


. To facilitate discussion of functions with an arbitrary number of vari- 


ables, we will use vector notation. For example, the equation in the above 
theorem can be written 


sä) = elh (a), ere’ h,(@)). 


Another important closure property of the functions representable in 
Cn Ap is closure under the “least-zero” operator. l 


Theorem 33M , Assume that the (m + 1)-place function g is represent- 
able and that for every a,, ..., a, there is a b such that l 


gla, sesam b) = 0. 
Then we can find a formula which represents the m-place function f, where 


flai, «++, am) = the least b such that gla, . R A b) =0. 


| 
| 
| 
| 
| 
| 
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(In vector notation we can rewrite this last equation: 
f(@) = the least b such that g(d, b) = 0. 
The traditional notation for the least-zero operator is 
f(a) = ubig(ā, b) = 0).) 
Proof To simplify the notation we take m = 1; thus 
f@=6b 


If represents g, then we can obtain a formula representing J (as a relation) 
simply by formalizing the right side of this equivalence: 


iff g(a, b) = Ô and for all c < b, g(a,c) 40. 


pr, Va, 0) A Vy < V => ~ho >Y, 0)). 


This formula defines (the graph of) f and is numeralwise determined by 
Ay. E 


A catalog 


We now construct a repertoire of representable (in Cn An) functions and 
relations, ‘including functions for encoding and decoding sequences. 


0. As a consequence of Theorem 33I any relation which has (in Ñ) a 


- quantifier-free definition is representable. And the class of representable 


relations is closed under unions, intersections, and complements. And if 
R is representable, then so are 


{lais ++ +5 Am, DX : for all c <b, (Oy, -yam C E R} 


and 


{lays +++) Im, b): for some c < b, (ay, 3 Am > ERY 


For example, any finite relation has a quantifier-free definition, as does 
the ordering relation. 


1. A relation R is representable iff its characteristic function Kpr is. (Kr ; 
is the function for which Kg(2) = 1 when de R, and K,(@) = 0 otherwise.) 


Proof (<=) Say that R is a unary relation (a subset of N) and that 
Kp is represented by y(i, va). We claim that y(v,, S0} represents R. For 
it defines R and is numeralwise determined by Az. 
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(=>) Say that p(v,) represents R. Then 
(plv) A va = S0) V (pli) A v =~ 0) 


represents (the graph of) Kr, for the same reason as in the last paragraph. 
(Actually this formula even functionally represents Kp, as the reader can 
verify.) a 


2. If R is a representable binary relation and .f, g are representable 
functions, then 


{@: <f(@), e) € R} 
is representable. Similarly for an im-ary relation R and functions f, Sod E 


Proof Its characteristic function at @ has the value K(f (@), 2(@)). 
Thus it is obtained from representable functions by composition. ] 


For example, suppose that R is a representable ternary relation. Then 
{x y> XY, x, x> € R} 
is representable, being 
{x y> : (18, y), T, y), Li, y)> € R} 


In this way we can rearrange and repeat variables in describing a repre- 
sentable relation. 


3. If R is a representable binary relation, then so is 
{<a, b> : for some c < b, <a, c> € R}. 
Proof We have from catalog item O that if 
Q = {<a, b> : for some c < b, <a, c) € R}, 
then Q is representable. And 


<a, b> € R < <a, S(b) € Q 
<> (I%(a, b), S(I3a, b))> € Q. 


Hence by catalog item 2, R is representable. 
More generally, if R is a representable (m + 1)-ary relation, then 


{<ar +++ Am, b> : for some c < b, ay, ..., dy, > € R} 


| 
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is also representable. In vector notation this relation becomes 
{<d, b> : for some c < b, <d, c> € R} 


Similarly, 
{<d, b> : for all c < b, (a, &> € R} 


is representable. 
-` 4, The divisibility relation 

{<a, b> : a divides b in N} 
is representable. 


Proof a divides b iff for some q < b, a : q = b. We know that {<a, b, 
qy :a - q = b} is representable, as it has a quantifier-free definition. Upon 
applying the above items, we get the divisibility relation. (In yet further 
detail, from catalog item 3 we get the representability of 


R = {<a, b, c> : for some q <c, a q = b} 
and a divides b iff <a, b, b> € R.) @ 
5. The set of primes is representable. 
6. The set of pairs of adjacent primes is representable, 


Proof <a, b> is a pair of adjacent primes iff a is prime and b is prime and 
a < b and there does not exist any c < b such that a < ¢ and c is prime. 
The right side of this equivalence is easily formalized by a numeralwise 
determined formula. 


Note (for future use in Section 3.7) that we have not yet used the fact 
that exponentiation is representable. 


7. The function whose value at a is pa, the (a + 1)st prime, is represent- 
able. (Thus po = 2, Pi = 3, P2 = 5, Ps = 7, pa = 11, etc.) 


Proof pa= b iff b is prime and there exists some c < b% such that 


(i) 2 does not divide c. 

(ii) For any q < b and any r < b, if <q, r> isa pair of adjacent primes, 
then for all j < c, . 
g divides c <> ri divides c. 


(iii) be divides c and b** does not. 
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This equivalence is not obvious, but at least the relation defined by the 
right-hand side is representable. To verify the equivalence, first note that 
if Ppa = b, then we can take 

c= 2°. 31 » 52. t.. o pe, 
It is easy to check that this value of c meets all the conditions, Conversely, 
suppose c is a number meeting conditions (i)-(iii), We claim that c must 
be i 
20. 31. «++ + be. powers of larger primes. 


Certainly the exponent of 2 in c is 0, by (i). We can use (ii) to work our 
way across to the prime b. But by (iii) the exponent of b is a, so b must. be 
the (a + 1)st prime. z 


This function will be very useful in encoding finite sequences of numbers 
into single numbers, Let 


Caos ey Ap) = PET s ves + pept 
= [] pm. 
ism : 
This holds also for m = —1; we define «>= 1. For example, 


(2, 1) = 28 . 32 = 72, 
The idea is that 72 safely encodes the pair <2, 1>. 


8. For each m, the function whose value at Ay, +. 


ae an is Cao, ii Am? 
is representable. 


9. There is a representable function (whose value at <a, b> is written: 
(a),) such that for b < m, 
(<a, e...) Amdo = ay. 
(This is our: “decoding” function.) 


Proof Let (a), be the least n such that either a = 0 or pi does not 
divide a. (There always is such an n.) Observe that (0), = 0, and for a Æ 0, 
(a), is one less than the exponent of p, in the prime factorization of a (but 
not less than 0). Hence for b < m, 


(Kao, baoa »4m))o = ap. 


` fined by 
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To prove representability we use the least-zero operator. Let 
R = {<a, b, n> :a=0 or pet does not divide a}. 
Then (a), = pa[Kx(a, b, n) = 0], where R is the complement of R, E 


Since the method used in the above. proof will be useful elsewhere as 
well, we here state it separately: 


Theorem 33N Assume that R is a representable relation such that 
for every g there is some n such that <a, n> € R. Then the function f de- 


f@) = the least n such that <d,n) € R 
is representable. 

Proof f(@) = pal KxG, n) = 0}. a 

We will later use the notation 
S(@) = prl<G, ny € R). 


10. Say that b is a sequence number iff for some m => —1 and some 


Foy s+ Ans 


b= (a; . any 


(When m = —1 we get <) = 1.) Then the set of sequence numbers is 
representable. 


Proof Exercise 5, 

11. There is a Sesceentabie function lh such that 
Ih{ay, --.5@m) = m+ 1. 

(“ih” stands for “length.’’) 


Proof Let lh a be the least n such that either a = 0 or p, does not divide 
a. This works. E i 


12, There is a representable function (whose value at <a, b> is called 
the restriction of a to b, written: a [ b) such that for any b<im-+ 1, 


Clos +++ 1 Am) | b = Coy ++ o-i) 
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Proof Let a} b be the least n such that either a = 0 or both n 0 
and for any j <b, any k <a l 
# divides a = pł divides n. 


This works. 


13. (Primitive recursion) With a (k + 1)-place function f we associate 
another function f such that f(a, bi, ..., by) encodes the values of f(j, 
bi, ...,b,) for all j <a. Specifically, let 


fla, b) = (£0, D, ..., f(a — 1, 8)». 


For example, fo, b) = = ( X = 1, encoding the first zero values of f. fa, D) 
= < f(0, By). In any case, Ña, b) is a sequence number of length a, encod- 
ing the first a values of f. 
Now suppose we are given a (k + 2)-place function g. There exists a 
unique function f satisfying 
J(e, D) = g(f(a, d), a, B. 
For example, 
SO, b) = g(¢ >, 0, b), 
SA, D = 8KO, D), 1, D). 
(The existence and uniqueness of this f should be intuitively clear. For 4 
proof, we can apply the recursion theorem of Section 1.2, obtaining first J 
and then extracting f.) 


_ Theorem 33P Let g be a (k + 2)-place function and let f be the unique 
(k+ 1)-place function such that for all a and (k-tuples) b, 


Sa, D = gf, D, a, b). 
If g is representable, then so is f. 


Proof First we claim that f is representable. This follows from the fact 
that 


f(a, b) 


== the least s such that s is a sequence number of length 2 
and for i less than a, (s); = g(s | i, a, D). 


It then follows that fi is representable, since 
f(a, b) = e/a, Ù, a, D 


and the functions on the right are representable. 


i SU ee | 
aa 
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Actually the phrase “primitive ens is more commonly applied 
to a simpler version of this, given in Exercise 8. 


14, For a representable function F, the function whose value at a, $ is 


TI rG, 4) 


i<a 
is also representable. Similarly with 2 in place of I. 


Proof Call this function G; then 


GO, D = 1, 
G(a + 1, b) = Fla, b) + Ga, D). 


Apply Exercise 8. E 
15. Define the concatenation of a and b, a b, by 
asb=a: TI afte 
This is a representable function of a and b, and 
lais o.e Ag) * Dis «+5 On = CA» - bi). 


The concatenation operation has the further property of being associative 
on sequence numbers. 


E ER ES 


16, We will also want a “capital asterisk” operation. Let 
kica F = fO) SA) +++ efa 1). 


For a representable function F, the function whose value at a, bis Kica Mi, b) 
is representable. 


Proof Kico FG, b) =<) = 1. 
K icat FG, D) = 


So this is just like catalog item 14. 


Kica FG, b) * F(a, D). 


EXERCISES 


“4. Show that in the structure (N, +, E) we can define {0}, <, the 
successor relation {<n, S(n)> :n € N}, and the addition relation {Km, n, 


m+n: m nin N} 


216 3. UNDECIDABILITY 


2. Prove Theorem 33C, stating that true (in Ñ) quantifier-free sentences 
are theorems of Ag. 


3. A theory T (in a language with 0 and S) is called w-complete iff for 
any formula g and variable x, if yng belongs to T for every natural number 
n, then Vx p belongs to T. Show that if T is a consistent w- complete theory 
in the language of N and if Ap S T, then T= Th N. 


4. Show that in the proof preceding Theorem 33L, formula (4) is 
logically implied by the set consisting of formulas (1), (2), and (3). 


5. Show that the set of sequence numbers is representable (catalog 
item 10). 


6. Is 3 a sequence number? What is lh 3? Find (1 « 3) + 6 and 1 x (3 * 6), 


7. Establish the following facts: 


(a) a+ 1 <p,. 

(b) (6), <b; equality holds iff b = 0. 

(c) lha <a; equality holds iff a = 0. 
(dd) afi<a. 

(e) Ih(a[ i) is the smaller of i and 1h a. 


8. Let g and h be representable functions, and assume that 
S(O, b) = gb), 
f(a + 1, b) = h(fG, b), a, b). 
Show that f is representable. 


9, Show that there is a representable function f such that for any i, 
Qo, ae a ay % 


NAC 


10. Assume that R is a representable relation and that g and h are 
representable functions, Show that f is representable, where 
n {e@) if ZER, 
a = 
I(@) ne if aoe 


> an?) = Ay. 


11. (Monotone recursion) Assume that R is a representable binary re- 
lation on N. Let C be the smallest subset of N (i.e., the intersection of all 
subsets) such that for all n; ag, ..., apa1; b, 


<lao, +665 an1) b> E€ R & a; € C (for all i <n) >bEC. 
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Further assume that (1) for all n, a), ... 


(doy ++ 


> n-i b, 
an1) D> E R=aq; <b for all i < n, 
and (2) there is a representable function f such that for all n, ao, .. 


<<a , 


<s n1 Ds 
-> ln) b E R=n < f(b). 


Show that C is representable. (C is, in a sense, generated by R. C54 Ø in 


. general because if << Y, b> € R, then be C.) 


§3.4 ARITHMETIZATION OF SYNTAX 


In this section we intend to develop two themes: 


1. Certain assertions about wffs can be converted into assertions about 
natural numbers (by assigning numbers to expressions). 

2. These (English) assertions about natural numbers can in some cases - 
be translated into the formal language. And the theory Cn A, is strong 
enough to prove some of the translations so obtained. 


This will give us the ability to construct formulas which, by expressing 
facts about numbers, indirectly express facts about formulas (even about . 
themselves). Such an ability will be exploited in Section 3.5 to obtain results 
of undefinability and undecidability. 


Gédel numbers 


We first want to assign numbers to expressions of the formal language. 
Recall that the symbols of our language are those listed in Table IX. 


TABLE IX . 
Parameters Logical symbols | 
OV 1. ( 
2. 0 3.) 
4,8 5. 
6. < i 
8. + 9. & 
10, > li. », 
12. E 13. v, etc 
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There is a function A assigning to each symbol the integer listed to its left 
Thus A(Y) = 0, A(0) = 2, and Aw) = 9 + 2i. In order to make our subse. 
quent work more widely applicable, we will assume only that we have some 
language with 0 and S which is recursively numbered. By this we mean that 
we have a one-to-one function + from the parameters of that language int 

the even numbers such that the two relations : 


{<k, my : k is the value of h at some m-place predicate symbol} 
and 


{<k, m : k is the value of h at some m-place function symbol} 


are both representable in Cn Apg. (Of course in the case of the language 

of - these sets are finite.) We define A on the logical symbols as before. 
or an expression ¢ = sg+-+-s, of the language we define i Bi 

eto, x guage we define its Gödel 


HS. t tt Sn) = Aso), ..- 5 ACS,)). 
For example, using our original function A for the language of N, we obtain 
(Av, va = 0) 
= H(-Vo,(-~»,0))) 


= 2? 38. 51. 716. 112. 139. 1710. 1918 . 233 . 294. 344, 


This is a large number, being of the order of 1.3 x 10%, To a set Ø of 
expressions we assign the set 


EO = (He) : © € D) 
of Gédel numbers. 


To a sequence <a, .. 
the number 


. »@,» of expressions (such as a deduction), we assign 


Faos one s An?) = CHa , n : 


» an). 

We now proceed to show that various relations and functions having 
to do with Godel numbers are representable in Cn Ay (and hence are re- 
cursive). As in the preceding section, whenever we say that a relation of 
function is representable (without specifying a theory) we mean that it i$ 
representable in the theory Cn Ag. 

We will make use of certain abbreviations in the language we use (i.e 
f English, although it is coming to differ more and more from what onë 


‘slice RON LUNN 


i aati 


l 
| 
| 
| 
| 
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ordinarily thinks of as English). For “there is a number a such that” we 
write “Ja.” In the same spirit, “da, b < c” means “there are numbers a 
and b both of which are less than c such that.” Similarly, we may employ 
«y,» We would not have dared to employ such abbreviations in Chapter 2, 
for fear of creating confusion between the formal language and the meta- 
language (English). But by now we trust the reader to avoid such erroneous 


ways. 


1. The set of Gödel numbers of variables is representable. 


Proof It is {a : (Eb < aja = (11 + 2b)}. It follows from results of 
the preceding section that this is a representable set. 


2. The set of Gödel numbers of terms is representable. 


Proof The set of terms was defined inductively. And terms were built 
up from constituents with smaller Gödel numbers. We will treat this case 
in some detail, since it is typical of the argument used for inductively de- 
fined relations. 

Let f be the characteristic function of the set of Gödel numbers of terms. 
From the definition of “term” we obtain 


1 if a is the Gédel number of a variable, 

i if (Ji, k < a) [i is a sequence number 
& (Vj <Ih)S(D,) = 1 & k is the value of 
h at some (lh i)-place function symbol & 
a = liy * ®jcni)s> 

0 otherwise. 


sQ = 


Although the right side of this equation refers to f, it refers only to f((i),), 
where (i); < a. This feature permits us to apply primitive recursion. f(a) 
= g(f(a), a), where 
1 if a is the. Gödel number of a variable, 
1 if (Ji, k <a@)[i is a sequence number 
& (Vj <Ihil(s)jy, = 1 & k is the 
value of h at some (Ih /)-place function 
symbol & a = <k) * Kjemi)» 
0 otherwise. 


als, a) = 


For if in this equation we set s equal to f(a), then (8), = S(O) for Oj < a. 
Hence by Theorem 33P, f is representable provided that g is. 
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It remains to show that g is representable. But this is straightforward, 
by using results of the preceding section. Briefly, the graph of g is the union 
of three relations, corresponding to the three clauses in the above equation, 
Each of the three is obtained from equality and other representable relations 


by bounded quantification and the substitution of representable func- 
tions. a 


3. The set of Gödel numbers of atomic formulas is representable, 


Proof a is the Gödel number of an atomic formula iff (di, k < afi is 
a sequence number & (Vj < Ih i)À(i); is the Gödel number of a term & k 
is the value of h at some (h i)-place predicate symbol & a= «k) 
t Kjemi). a 


4, The set of Gödel numbers of wffs is representable, 


Proof The wffs were inductively defined. Let f be the characteristic func- 
tion of the set, then 


1 if a is the Gödel number of an atomic formula, 
1 if (di < ola = A O, AY ie CAC) )) 
& f(i) = 1], 


1 if (Gij < ala = ChC()) * i» CA) s je CAC) 
& fli) = fj) = 1, 

1 if (di, j <a@la = <A(Y)y «ix j&i is the Gödel 
number of a variable and /(j) = 1], 

0 otherwise. 


fla) = 


By the same argument, used for the set of Gödel numbers of terms, we 
have the representability of f. a 


5. There is a representable function Sb such that for a term or formula 
a, variable x, and term ż, 


Solle, thx, He) = tog. 


Proof We will define Sb to be the unique function such that Sb(a, b, ¢) 
= d iff one of the following holds: 


(i) a is the Gödel number of a variable and a = b and 
d=c, 


(ii) i,k <a)fi is a sequence number & (Vj < 1h i(i); is the Gödel 


l 
| 
| 
| 
| 
| 
| 
| 
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| 
| 
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number of a term & k is the value of A at some (lh i)-place function or 
predicate symbol and a = {k} * %;<n:(é); and 

d= <k} * Kjethi Sb(();, b, c)]. 

(iii) (Ai < a){i is the Gödel number of a wi &a= ¢h((), =) +i 


» <hC))> and 
d = ¢h( (), A(>)) * SbG, b, c) * <hO ))). 


(iv) Gi, j < a)i and j are Gödel numbers of wffs & a = ¢h(()) xix 
h>) * j= ChC))> and 
d = hC ())  SbG, b, c) * <A(>)) * Sb, b, €) = CAC) DX]. 


(v). (li, j < a)[i is the Gödel number of a variable & iA b & j is the 
Gödel number of a wf & a = <A(Y)y»i»j and 


d = (h(W)) «i Sb(j, b, o)l. 
(vi) None of the above conditions on g and b are met (where,we ignore 
the displayed equation for d) and d = a. 


Observe that the above does uniquely determine the value of Sb(a, b, c). 
No two clauses could apply to one number a. And if, for example, clause 
(ii) applies to a, then we know from Section 2.3 that the numbers i and k 


are unique. -d 
Finally, we apply the argument used for the set of Gödel numbers of 
terms, viewing Sb as a 4-ary relation. 


6. The function whose value at n is 4(S"0) is representable. 
Proof Call this function f, then 


SO) = h)), 
f(a + 1) = CAS) *f@). 


Apply Exercise 8 of the preceding section. — 


7. There is a representable relation Fr such that for a term or formula 
œ and a variable x, 


<a, tx> € Fr <> x occurs free in æ. 


Proof <a, b> € Fr <> Sb(a, b, #9) Æa. a 
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8. The set of Gödel numbers of sentences is representable. 


Proof ais the Gödel of a sentence iff a is the Gödel number of a formula 
and for any b < a, if b is the Gödel number of variable then <a, b) ¢ Fr. m 


9, There is a representable relation Sbl such that for a formula œ, variable 
x, and term ¢, <Ha, tx, #¢> € Sbl iff ¢ is substitutable for x in a. 


Proof Exercise 1. 


10. The relation Gen, where <a, b> e Gen iff a is the Gödel number of 
a formula and b is the Gödel number of a generalization of that formula, is 
representable. 


Proof <a, by e Gen iff a = b or (Si, j < b)fi is the Gödel number of 
a variable and <a, j> € Gen and b = <h(W)) «ix j]. Apply the usual ar- 


' gument to the characteristic function of Gen. 2 


11. The set of Gödel numbers of tautologies is representable. 


The set of tautologies is intuitively decidable since we can use the method 
of truth tables. To obtain representability, we recast truth tables in terms 
of Gödel numbers. There are several preliminary steps: 


11.1 The relation R, such that <a, b> € R iff a is the Gödel number of 
a formula a and b is the Gödel number of a prime constituent of a, is 
representable, © 


Proof <a, by € Ra is the Gödel number of a formula and one of 
(i) a = b & a) E (O. 

(ii) Gi < ala = Ch (), A=) =i» <h())) and <, bX € R]. 

Gii) The analog to (ii) but with —. 


Apply the usual argument to the characteristic function of R. a 


11.2 There is a representable function P such that for a formula g, 
Pa) = CHB, ...,48,), the list of Gödel numbers of prime constituents 
of a, in numerical order. l 


Proof First define a function g for locating the next prime constituent. - 


in ha after hy (where ka is the formula æ for which a = ta). 


g(a, y) = the least n such that either n =a + 1 or both 
y<nand (ane R 


T si tai iy 
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Next define a function h such that h(a, n) lists the first n prime constituents 


of ha (if there are that many): 


f if g(a, 0) > a, 
h(a, 0) Eri { (g(a, 0)> otherwise. 
fha, n) , if gla, t) > 4, 
h(a, n + 1) = { h(a, n)» (g(a, t)> otherwise. 


Hére ż is the last component of h(a, n), obtained from h(a, n) by the repre- 
sentable function of Exercise 9 of the preceding section. Finally, let P(@@) = 
h(a, a). El 


11.3 Say that the integer v encodes a truth assignment for a iff v is a 
sequence number and lhv = lh P(o) and (Vi <lhv)Ge < 2)v); = 
(PR), e). This is a representable condition on v and $a. 

For example, if P(x) = <#Bo, ...,#6,>, then 


p= CCHBo» Coys sees CHB ns en) 


where each e; is 0 or 1. We will later need an upper bound for v in terms 
of ita. The largest v is obtained when each e; is 1. Also #8; < #œ, so that 


v < (CHa, 1), ..., a, 1X) 
= Kich Puja H, 22 


11.4 There is a representable relation Tr such that for a formula œ and 
a v which encodes a truth assignment for œ (or more), <H, v> e Tr iff that 
truth assignment satisfies æ. 


Proof Exercise 2. 


Finally, œ is a tautology iff œ is a formula and for every v encoding a 
truth assignment for «, <Ha, vy e Tr. The (English) quantifier on v can be 
bounded by a representable function of ta, as’ explained in 11.3. 


12. The set of Gödel numbers of formulas of the form Vx p —> yf, where 


‘tis a term substitutable for the variable x in p, is representable. 


=~ Proof « is of this form iff (3 wi p <a)(3 variable x < &) (A term 
t <a)[t is substitutable for x in p and a= Vx g — gj]. Here “p <a” 
means that itp < ite. This is easily rewritten in terms of Gödel numbers: 
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a belongs to the set iff (Af < aax < a)(Gt < a)i f is the Gödel number of a 
formula & x is the Gödel number of a variable & t is the Gödel number 
of term and <f, x, 1) € SblL& 


a = Ch (), A(W)) * x * f* CAC>)) * SEF, x, 1) * KACO DY. 


13. The set of Gödel numbers of formulas of the form Vx(a— f) + Vx a 
=> Vx B is representable. í 


Proof y is of this form iff (3 variable x < y) (3 formulas a, B < y)[y 
= Vx(a -> B) > Yxa — Vx f]. This is easily rewritten in terms of Gédel 
numbers, as in 12. 


14. The set of Gödel numbers of formulas of the form æ -» Vx æ, where 
x does not occur free in œ, is representable. 


Proof Similar to 13. | 


15. The set of Gödel numbers of formulas of the form x == x is represen- 
table. 


Proof Similar to 13. @ 


16. The set of Gödel numbers of formulas of the form x = y —>@ => a', 
where « is atomic and a’ is obtained from œ by replacing x at zero or more 
places by y, is representable. j 


Proof This is similar to 13, except for the relation of “partial substitu- 
tion.” Let <a, b, x, yò € Psb iff x and y are Gödel numbers of variables, a 
is the Gödel number of an atomic formula, b is a sequence number of 
length lh a, and for all j < 1h a, either (a); = (b); or (a); = x and (b); = y. 
This relation is representable. 


17. The set of Gödel numbers of logical axioms is representable. 


Proof œ is a logical axiom iff 38 <a such that œ is a generalization of 
B and £ is in one of the sets in items 11-16. ia 


18. For a finite set A of formulas, 
{F(D) : D is a deduction from A} 


is representable. 


3.4 Arithmetization of Syntax 225 


Proof A number d belongs to this set iff d is a sequence number of 
positive length and for every i less than lh d, either 


1. (d); € #4, 


2. (d) is the Gödel number of a logical axiom, or 
3. AJ, k <i) KOy = A OY * Mp * A) * @; * AC) YN. 


This is representable whenever #4 is, as is certainly the case for 
finite A. E 


19. Any recursive relation is representable in Cn Apg. 


Proof Recall that the relation R is recursive iff there is some finite con- 
sistent set A of sentences such that some formula ọ represents R in Cn A. 
(There is no loss of generality in assuming that the language has only fi- 
nitely many parameters: those in the finite set A, those in ọ, and 0, S, and 
V.) In the case of a unary relation R, we have that a € R iff the least D 
which is a deduction from A of either e(S*0) or ~e(S“0) is, in fact, a de- 
duction of the former. 

More formally, a € R iff the last component of f(a) is #e(S*0), where 


f(a) = the least d such that d is in the set of item 18 and the last 
component of d is either #o(S“0) or #-e(S*0). 


For this (fixed) 9, there always is such a d. a 
Since the converse to item 19 is immediate, we have 


‘Theorem 34A A relation is recursive iff it is representable in the theory 
Cn Ax. ' 


Henceforth we will usually use the word “recursive” in preference to 
“representable.” 


Corollary 34B Any recursive relation is definable in K. 


20. Now suppose we have a set A of sentences such that #A is recursive. 
Then #Cn 4 need not be recursive (as we will show in the next section). 
But we do have a way of defining Cn 4 from A: 


aétCnA iff Jdid is the number of a deduction from A 
and the last component of d is a and a is the 


Gédel number of a sentence] 


226 3. UNDECIDABILITY 


The part in square brackets is recursive, by the proof to item 18. But we. 
cannot in general put any bound on the number d. The best we can:say. 
is that #Cn A is the domain of a recursive relation (or, as we will say later, : 


is recursively enumerable). 


21, If #4 is recursive and Cn A is a complete theory, then #Cn A is re» 
cursive. _ 


In other words, a complete recursively axiomatizable theory is recursive, 
This is the analog to Corollary 25G, which asserts that a complete axiom- 
atizable theory is decidable. 

The proof is essentially unchanged. Let 


g(s) = the least d such that s is not.a sentence, or d is in the set of 
item 18 and the last component of d is either s or is <h( (), 


hn) * 5 * ChC))). 


Thus g(ito) is F of the least deduction of o or (~o) from A. And s € #Cn 4 
iff s > 0 and the last component of g(s) is s. 


At this point we might reconsider the plausibility of Church’s thesis. 
Suppose that the relation R is decidable. Then there is a finite list of explicit 
instructions for the decision procedure. The procedure itself will presumably 
consist of certain atomic steps, which are then performed repeatedly. (The 
reader familiar with computer programming will know that a short pro- 
gram can still require much time for its execution, but some commands 
will be utilized over and over.) Any one atomic step is presumably very 
simple. . E 

By devices akin to Gödel numbering, we can mirror the decision proce- 
dure in the integers. The characteristic function of R can then be put in the 
form 


K,(@) = Ufthe least s such that 


(i) (s)y encodes the input Z; — 
(ii) for all positive i < ih s, (s); is obtained from (s);_, by performance 
of the applicable atomic step; 
(iii) the last component of s describes a terminal situation, at which the 
computation is completed], 


where U (the upshot function) is some simple function that extracts from 


the last component of s the answer (affirmative or negative). The recul- 


KN aN 
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siveness of R is now reduced to the recursiveness of U and of the relations 
indicated in (i), (ii), and (iii). In special cases, such as decision procedures 
provided by the register machines of Section 3.8, the recursiveness of these 
components is easily verified. It seems most improbable that any decision 
procedure will ever be regarded as, effective and yet will have components 
which are nonrecursive. For example in (ii), it seems that it ought to be 


possible to make each atomic step extremely simple, and in particular to 
make each one recursive, 


EXERCISES 
1. Supply a proof for item 9 of this section, 


2. Supply a proof for item 11.4 of this section. 


3. Use Exercise 11 of Section 3.3 to. give a new proof that the set of Gédel 
numbers of terms is representable (item 2). , 


4. Let T be a consistent recursively axiomatizable theory in a recursively 


numbered language with 0 and S. Show that any relation representable in T 
must be recursive, 


§3.5 INCOMPLETENESS AND UNDECIDABILITY 


In this section we reap the rewards of our work in Sections 3.3 and 3.4, 
We have assigned Gödel numbers to expressions, and we have shown that 


- certain intuitively decidable relations on NV (related to syntactical notions 


about expressions) are representable in Cn Ap. 
Throughout this section we assume that the language in question is the 
language of N. (This affects the meaning of “Cn” and “theory.”) 


Fixed-Point Lemma Given any formula f in which only v, occurs 
free, we can find a sentence ø such that 


Agr | [o + A(S#0)], 


We can think of o as indirectly saying, “$ is true of me.” Actually o 
doesn’t say anything of course, it’s just a string of symbols. And even when 
translated into English according to the intended structure N, it then talks 
of numbers and their successors and products and so forth. It is only by 
virtue of our having associated numbers with expressions that we can think 


_ (Of ø as referring to a formula, in this case to ø itself. 
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Proof Let (4, %, v) functionally represent in Cn Ay a function whose 
value at <Ha, n> is 4(a(S"0)). (See items 5 and 6 in Section 3.4.) First consider 


the formula 


(1) KAC OTEZE va) > B(v,)1. 


(This formula has only v, free. It defines in N a set to which #æ belongs 
iff #(e(S#0)) is in the set defined by £.) Let q be the Gédel number of (1). 


Let o be 
Vv,[6(S20, S10, v3) —> B(v,)]. 


Thus o is obtained from (1) by replacing v», by S10, Notice that o does 
assert (under N) that $o is in the set defined by f. But we must check that 


(2) o «+ B(S#0) 
is a consequence of Az. We have, by our choice of 6, 
(3) Ap + Vvs{0(S0, S, v) <> va ~= SiO}. 
We can obtain (2) as follows: 
(>): It is clear (by looking at o) that 
o | (80, $0, S#0) —> B(S#0). 


And, by (3), 
Ag | (S20, S%0, S#0). 


Hence 
Ar 30 | B(S*0), 


which gives half of (2). 
(<-): The sentence in (3) implies 


p(S#0) > [V va (0(S%0, 5%, vs) > B(v3)))- 
But the part in square brackets is just o. E 


Our first application of this lemma does not concern the subtheory Cn Ap, 
and requires only the weaker fact that 


Hula <> BS*0)]. 


‘Tarski Undefinability Theorem The set #Th N is not definable in %. 
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Proof Consider any formula B (which you suspect might define #Th N). 
By the fixed-point lemma (applied to ~f) we have a sentence o such that 


Eglo +> —B(S*0)}. 
| de p did define #Th N, then o would indirectly say “I am false.”) Then 
Ex o > Fe B(S*0), 


so either o is true but not in the set 8 defines, or it is false and in that set. — 
Either way o shows that 6 cannot define #Th N. a 


The above theorem gives at once the undecidability of the theory of N: 
Corollary 35A #ThN is not recursive. 


Proof Any recursive set is (by Corollary 34B) definable in Mt. ] 


sive, then Cn A is not a complete theory. 
Thus there is no complete recursive axiomatization of Th R. 


Í 
i 
| 
| 
| 
I 
l 
| 
| 
| 
Gödel incompleteness Theorem (1931) If Ac Th® and #A is recur- 
| 
| Proof Since A S Th®, we have Cn A &.Th. If Cn A is a complete 
theory, then equality holds. But if Cn A is a complete theory, then #Cn A 
is recursive (item 21 of the preceding section). And by the above corollary, 
#Th N is not’ recursive. f 
In particular, Cn Ay is not a complete theory and so is not equal to 
Th 9. And the incompleteness would not be eliminated by the addition of 
any recursive set of true axioms. (By a recursive set of sentences we mean 
| of course a set Æ for which #2 is recursive.) 
/ We can extract more information from the proof of Gédel’s theorem. 
| Suppose we have a particular recursive A S Th Nin mind. Then by Theorem 
34A we can find a formula f which defines #Cn A in N. The sentence o 
produced by the proof to Tarski’s theorem is (as we noted there) a true sen- 
/ tence not in Cn A. This sentence asserts that to does not belong to the set 
- defined by 8, i.e., it indirectly says, “I am not a theorem of A.” Thus A ¥ o 
and of course 4 {~w as well. This way of viewing the proof is closer to 
Gödels original proof, which did not involve a detour through Tarski’s 
theorem. For that matter, Gédel’s statement of the theorem did not involve 
Th N; we have taken some liberties in the labeling of theorems. 
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Next we need a lemma which says (roughly) that one can add one new 
axiom (and hence finitely many new axioms) to a recursive theory without | 
losing the property of recursiveness. `° 


Corollary 35D Assume that #2% is recursive and 2 U Ag, is consistent. 
Then Cn 2 is not a complete theory. 


Proof A complete recursively axiomatizable theory is recursive (item 
21 of Section 3.4). But #Cn X is not recursive, by the above theorem. B 


Lemma 35B If ¢#Cn X is recursive, then #Cn(2' ; t) is recursive. 


Proof a € Cn(2 ; t) <> (r-a) € Cn X. Thus ; . 
This corollary is Gédel’s incompleteness theorem again but with truth 


a € #Cn(Z ; r) = a is the Gödel number of a sentence and ¢h( ()) in N replaced by consistency with Ay. 


x fe x Ch(—>)) » a » Ch()) is in #Cn Z. . i ; 
Church’s Theorem (1936) The set of Gödel numbers of valid sentences 


This is recursive by the results of the preceding sections, a (in the language of N) is not recursive. 


Theorem 35C (strong undecidability of Cn 4p) Let T be a theory such 


; ; : Proof In the strong undecidability of Cn Az, take T to be the smallest 
that T U A, is consistent. Then #7' is not recursive. f = i 


theory in the language, the set of valid sentences. 


i 
a 
| 
i 
1 
i 
i 
1 
i 
| 
| 
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| 
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(Notice that because throughout this section the language in question 
is the language of R, the word “theory” here means “theory in the language 
of N.”) 


The set of Gödel numbers of valid wffs is not recursive either, lest the set 
i of valid sentences be recursive. 
| This proof applies to the language of N. For a language with more pa- 
rameters, the set of valid sentences is still nonrecursive (lest its intersection 
l with the language of N be recursive). Actually it is enough for the language 


Proof Let T' be the theory Cn(T U Ap). If #T is recursive, then since 
Ap is finite we can conclude by the above lemma that #1” is also recursive. 


Suppose, then, that #T’ is recursive and so is represented in Cn Ag by | to contain at least one two-place predicate symbol. (See Corollary 36D.) 
some formula $. From the fixed-point lemma we get a sentence ø such that | On the other hand, some restriction on the language is needed. If the lan- 

| guage has Y as its only parameter (the language of equality), then the set of 

€) Ag | [o <> “B(S*0)]. -valid formulas is decidable. (See Exercise 6.) More generally, it is known 


that if the only parameters are Y and one-place predicate symbols, then the 


(Indirectly o asseris, “I am not in T’.”) set of valid formulas is decidable. 


o ¢ T' => $o ¢ HT" 


=> An | —B(S*0) Recursive enumerability 
= Arjo. by (*) l A relation on the natural numbers is sgid to be recursively enumerable 
=oET"', l iff it is of the form 


. {a : 3b <d, by € Q} 
So we get o € T'. But this, too, is untenable: i = 
with Q recursive. Recursively enumerable relations play an important 


oe T' = foe er role in logic. They constitute the formal counterpart to the effectively 


=> Ay | B(S'0) / enumerable relations (as will be explained presently). 
Pale by) Theorem 35K The following conditions on an m-ary relation R are 
Sd equivalent: l 

which contradicts the consistency of T”. a 1. R is recursively enumerable. — 


2. R is the domain of some recursive relation Q. 
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3. For some recursive (m +- 1)-ary relation Q, 


R= {<ay, eng > * Ibla, <- -sam bD) E Q}. 


4, For some recursive (m + n)-ary relation Q, 
R = {lays ey)? Ibis -b 


‘Proof By definition 1 and 3 are equivalent. Also 2 and 3 are equivalent 
by our definition (in Chapter 0) of domain and (m + 1)-tuple. Clearly 3 
implies 4. So we have only to show that 4 implies 3. This is because 


Maison s Ams Dis -eeo Da2 E Q) 


Ibi... bn llis <-s Ams Dis eba EQ 

iff Jcay, ...5 Oms (Cos -o (On E Q 
and : 
{<ay, tri s Om» c> RCR ERE > Am» (Cos era s (C) € Q} 


is recursive whenever Q is recursive. (Here we hover used ` our Sequence 
decoding function to compress a string of quantifiers into a single one.) a 


By part 4 of this theorem, R is recursively enumerable iff it is definable 
in N bya formula Jx, -+ 3x, p, where p is numeralwise determined by Az. 
In fact, we can require here that p be quantifier-free; this result was proved 
in 1961 (with exponentiation) and 1970 (without exponentiation). The proofs 
involve some number theory; we will omit them here. 

Notice that any recursive relation is also recursively enumerable., For 
if R is recursive, then it is defined in N by a formula 4x, e Ix, 9, where 
p is-numeralwise determined by Ag and xı, ...,%, do not occur in ¢. 


Theorem 35F A relation is recursive iff both it and its complement 
are recursively enumerable. 


This is the formal counterpart to the fact (cf. Theorem 17F) that a rela- 
tion is decidable iff both it and its complement can be effectively enumerated. 


Proof Jf a relation is recursive, then so is its complement, whence both 
are recursively enumerable. a 

Conversely, suppose that both P and its complement are recursive:y 
enumerable; thus for any .d, 


de P< Aba, by EQ, 
ae PL Aba ber 


` we check to see if <(m), (m),, (m)2> € R for m = 0, 1, 2, 
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for some recursive Q and R. Let 


f(@) = the least b such that either <d, b> € Q or (a bY € R. 


Such a number b always exists, and f is recursive. Finally, 


deP<=<@, fa) EQ, 


so P is recursive, 


The recursively enumerable relations constitute the formal counter- 
part of the effectively enumerable relations. For we have the following in- 


formal result, parallelling a characterization of recursive enumerability 
given by Theorem 35E, 


*Lemma 35G A relation is effectively enumerable iff it is the domain 
of a decidable relation. 


Proof Assume that Q is effectively enumerated by some procedure. 
Then e Q iff Ja[7 appears in the enumeration in n steps]. The relation 
defined in square brackets is decidable and has domain Q. 

Conversely, to enumerate {<a, b> : In <a, b, n> € R} for decidable R, 


. Whenever 
the answer is affirmative, we place <(m)9, (m),> on the output list. fa 


*Corollary 35H (Church’s thesis, second form) A relation is effectively 
enumerable iff it is recursively enumerable. 


Proof By identifying the class of decidable relations with the class of 
recursive relations, we automatically identify the domains of decidable 
relations with domains of recursive relations. 


The second form of Church’s thesis is, in fact, equivalent to the first 
form. To prove the first form from the second, we use Theorems 35F 
and 17F. 

We have already shown that a recursively axiomatizable theory is re- 
cursively enumerable, but using different words. We restate the result here, 
as it indicated the role recursive enumerability plays in logic. 


Theorem 351 If A is a set of sentences stich that #A is recursive, then 
#Cn A is recursively enumerable. 


Proof Item 20 of Section 3.4. 
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This theorem is the precise counterpart of the intuitive fact that a theory 
with a decidable set of axioms is effectively enumerable (Corollaries 25F 
and 26F). It indicates the gap between what is provable in an axiomatic 
theory and what is true in the intended structure. With a recursive set of 
axioms, all we can possibly obtain is a recursively enumerable set of conse- 
quences, But by Tarski’s theorem, Th N is not even definable in N, much 
less recursively enumerable. 

Even if we expand the language or add new axioms, the same phenom- 
enon is present. As long as we can recursively distinguish deductions from 
nondeductions, the set of theorems can be only recursively enumerable. 
For example, the set of sentences of number theory provable in your favorite 
system of axiomatic set theory is recursively enumerable. Furthermore, 
this set includes 4, and is consistent (unless you have very strange favorites). 
It follows that this set theory is nonrecursive and incomplete. (This topic 
is discussed more carefully in Section 3.6.) 


Weak representability 
Consider a recursively enumerable set Q, where 
ac Q< ib<a,bbeR 


for recursive R. We know there is a formula @ which represents R in Cn Ag. 
Consequently, the formula 3v,ọ defines Q in N. This formula cannot 
represent Q in Cn Ay unless Q is recursive. But it can come halfway. 


aceQ=<la,b) ER for some b 
= Ag } o(S%0, S°0) for some b 
=> Ag + Jv, o(S, vs). 

a% Q = <a, b> ¢ R for all b 
= Ay | ~e(S%, SO) for all b 


=> Ay Z Av, o(S%0, v). 


The last step is justified by the fact that if Ay H —=g(S°0) for all b, then 
An | Ax p(x). (The term w-consistency is given to this property.) For it 
is impossible for Ax p(x), —~p(S%), ~p(S'0), ... all to be true in %. 
Thus we have 
ae Q < An H Av, o(S%, va). 


It will be convenient to formulate a definition of this half of representability. 


| 
i 


AAALAC 
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Definition Let Q be an n-ary relation on N, y a formula in which only 


Vy, ... 7 i 
i s Va occur free. Then y weakly represents Q in a theory T iff for every. 
dis 34 in N, : . 


(Ay, 66,4) E Q <> y(S4,..., SO) ET. | 


Observe that if Q is representable in i | 
> a consistent theory T, t i 
weakly representable in T. EES 


Theorem 35J A relation is weak! R 
f y representabl I e o; 
cursively enumerable. p e in Cn Æp iff it is re- 


Proof We just showed that a recursively enumerable unary relation 
Q is weakly representable in Cn Ay; the same proof applies tO nary Q 


with only notational changes. Conversel 
; eS. y, let be weakly r 
by y in Cn Ag. Then ike erent 


(4, y an> E R<> AD[D is a deduction of y(S%0, ..., SO) 
from the axioms Ay] 


<> Idd, f(a, ..., a_)> € P 


for a certain recursive function f and recursive relation P E 


Arithmetical hierarchy 


Define a relation on the natural numbers to be arithmetical iff it is de- 
finable in N. But some arithmetical relations, are, in a sense, more defin- 
able than others. We can organize the arithmetical relations into a hierarch 
according to how definable the relations are. i 

Let A, be the class of recursive relations, and let X; be the class of re- 
cursively enumerable relations, Then we proceed to define the classes ZT 
and 2, by recursion on k. Let a relation belong to My iff its aln 
is in Zr . And let 24; be the class of domains of I, relations. (If we let 
w = Jo = Aj, then this will hold also for k = 0.) For example the first 
few classes consist of relations of the form shown in the Retort column: 


ay: {@: 4b Gd, by e R}, R recursive. 
If: {@: Vb <@, by € R}, R recursive. 
Ža: {@: de Vb <a, b,c € R}, R recursive. 
My: {a@: Ve qb <4, b, > € Ry; R recursive. 
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In general, a relation Q is in J, iff it is of then form 
{7 : Vb, Iba +» C bya, BY € R} 


for a recursive relation R. Here “O” is to be replaced by “Y” if k is odd 
and by “3” if k is even. Similarly, Q is in Zp iff it has the form 


{a : Ib, Wb «++ C bla, bY € R} 


for recursive R, where now “O” is replaced by “3” if k is odd and by 
“7” if k is even. 


EXAMPLE The set of Gödel numbers cof formulas numeralwise determi- 
ned by Ag is in H. 


Proof a belongs to this set. iff [a is the Gödel number of a formula o] 
and Wb Ad{d is Z of a deduction from Ay either of a(S, S20, ...) 
or of the negation of this sentence]. By the technique of Section 3.4, we 
can show that the phrases in square brackets define recursive relations. By 
using the English counterpart to prenex form, we obtain the desired form, 


{a : Vb Ada, b, d) E R}, 
with R recursive. E i 


Our earlier result (Theorem 35F) stating that a relation is recursive iff 
both it and its complement are recursively enumerable can now be stated 


by the equation 
á =% Noh. 


Since this equation holds, we proceed to define 4, for n > 1 by the analogous 
equation, 


A, = Zn O Tp. 


The following inclusions hold: 


M, IT, IT, 


The case A, € X, was mentioned previously (cf. Theorem 35F); its proof 


| 
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nonarithmetical sets 


arithmetical sets 


Figure 9. Picture of PN. 


hinged on the possibility of “vacuous quantification.” The proofs of the 
other cases are conceptually the same. If x does not occur in p, then 9, 
Vx, and x g are all equivalent. For example, a relation in 2, is defined 
by a formula Ay p, where g is numeralwise determined by Az. But the same 
relation is defined by Sy Vx p and Vx 3y p (where x does not occur in g). 
Hence the relation is also in X% and F. 

It is also true that all the inclusion shown are proper inclusions, i.e., 
equality does not hold. But we will not prove this fact here. The inclusions 
are shown pictorially in Fig. 9. 

The class of arithmetical relations equals Uz; 2, and also Ugly. For 
example, any relation in 2, is arithmetical, being defined in N by a formula 
Ax Wy y, where ¢ is numeralwise determined by Ag. Conversely, any arith- 
metical relation is defined in N by some prenex formula. The quantifier- 
free part of this prenex formula defines a recursive relation (since quantifier- 
free formulas are numeralwise determined by Ay). Consequently, the de- 
fined relation falls somewhere in the hierarchy. 

There are certain tricks which are useful in locating specific arithmetical 
relations in the hierarchy. For example, let A be the set of Gödel numbers 
of formulas a such that for some n, f 


Ag a(S") and (Vi < n) Ag H 7a(S'0). 


Then a E A iff ia is the Gödel of a wif a] and 3n{AD[D is a deduction from 


1 
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Ap of o(S"0)] and (Vi < n)\(AD,)[D; is a deduction from Ag of —a(S*‘0)]}. 
The parts in square brackets are recursive so we count the remaining quan- 
tifiers. The bounded quantifier “Vi < n” need not be counted. For we have 


(Vi < Nd) <d, i> € P <> (Ad\(Vi < n) ((a);,D © P. 


Use of this fact lets us push the bounded quantifier inward until it merges 
with the recursive part. Consequently, A € %4. 
The following theorem generalizes Theorem 351. 


Theorem 35K Let A be a set of sentences such that #A is in X,. Then 
#Cn A is also in diy. 


Proof Return to the proofs of items 18 and 20 of Section 3.4. We had 
there: 


aéitCn A <a is the Gödel number of a sentence and Jd[{d is a se- 
quence number and the last component of d is a and for 
every i less than lh d, either (1) (d); € #A. (2) (d); is the 
Gédel number of a logical axiom, or (3) for some j and 


k less than i, (d); = ACY * Dz * CAC>)) * (a)i * (A0))1- 


Since #A € 2, in (1) we must replace “(d); € tA” by something of the 
form 
1b, Wb, «++ O br <d DE O 


for recursive Q. It remains to convert the result into an English prenex 
expression in 2, form. We suggest that the reader set k = 2 and write 
out this expression; the device used in the preceding example will help. E 


EXERCISES 


1. Show that there is no recursive set R such that #Cn Ay © R and 
Ho : (0) € Cn Ap} S R. (This result can be stated: The theorems of 
Ay cannot be recursively separated from the refutable sentences.) 


2. Let A be a recursive set of sentences in a recursively numbered lan- 
guage with 0 and S. Assume that every recursive set is representable in the 
theory Cn A. Further assume that A is w-consistent; i.e., there is no for- 
mula y such that A } dx g(x) and for all ae N, A | —=q(S%), Con- 
struct a sentence o indirectly asserting that it is not a theorem of A, and 
show that neither A j- o nor Abo. l 


ANDUN 


ai G 


EN 
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3. Let T be a theory in a recursively numbered language (with 0 and $). 
Assume that all recursive subsets of X are weakly representable in T. Show 
that #7 is not recursive. Suggestion: Construct a binary relation P such — 
that any weakly representable subset of N equals {b : <a, b> € P} for some 
a, and such that P is recursive if #7 is. Consider {b : <b, b> ¢ P}. 


4, Show that there exist 28° nonisomorphic countable models of Th K. 
Suggestion: For each set A of primes, make a model having an element 
divisible by exactly the primes in A. 


5. (Lindenbaum) Let T be a decidable consistent theory (in a reasonable 
language). Show that T can be extended to a complete decidable consistent 
theory T”. 


6. Consider the language of equality, having V as its only parameter. 
Let A,, be the translation of “There are at least n things,” cf. the proof of 
Theorem 26B. Call a formula simple iff it can be built up from atomic 
formulas and the A,’s by use of connective symbols (but no quantifiers). 
Show how, given any formula in the language of equality, we can find a 
logically equivalent simple formula. Suggestion: View this as an elimina- 
tion-of-quantifers result (where the quantifers in A, do not count). Use Theo- 
rem 31F. 


7. (a) Assume that A and B are subsets of N” belonging to Xy (or Ma). 
Show that A U B and A N B also belong to 2; (or Iy, respectively). 

(b) Assume that A is in Xp (or M) and that the functions f}, ...5/n 
are recursive. Show that 


{a:<fi(@),... 


is also in Xy (or My, respectively). 


Sala) € A} 


8. Let T be a theory in a recursively numbered language (with 0 and S). 
Let n be fixed, n > 0. Assume that all subsets of N in Z, are weakly rep- 
resentable in 7. Show that #7 is not in /7,. (Observe that Exercise 3 is 
a special case of this, wherein n = 0. The suggestions given there carry 
over to the present case.) 


§3.6 APPLICATIONS TO SET THEORY 


We know that in the language of number theory, Cn Ay is incomplete 
and nonrecursive, as is any compatible recursively axiomatizable theory 
in the language. 
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But now suppose we leave arithmetic for a while and look at set theory, 
Here we have a language (with the parameters V and €) and a set of axioms, 
In all presently accepted cases the set. of axioms is recursive. Or more 
precisely, the set of Gödel numbers of the axioms is recursive. And so the 
theory (set theory) obtained is recursively enumerable. We claim that this 
theory, if consistent, is not recursive and hence not complete. We will 
presently prove this, but we can already sketch the argument in rough form. 
We can, in a very real. sense, embed the language of number theory in set 
theory. We can then look at that fragment of set theory which deals with 
the natural numbers and their arithmetic (the shaded area in Fig. 10). 
That is a theory compatible with Ay. And so it is nonrecursive. Now 
if set theory were recursive, then its arithmetical part would also be re- 


Sentences in the 
language of num- 
ber theory 
(recursive) 


Set theory 
(recursively 


N enumerable) 


(a) 


Sentences in the language of 
number theory (recursive) 


Image of the above 
set under an 
. interpretation 


” Set theory 
(recursively p 
' enumerable) ~~ 


(b) 


Figure 10. Set theory and number theory. (a) Flat picture. (b) A more accurate picture: 
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cursive, which it isn’t. As a bonus, we will come across a remarkable theorem 


; regarding the diffculty of proving set theory to be consistent. 


Henceforth by set theory (ST) we mean that theory (in the language. 
with equality having the two parameters Y and €) which is the set of con- 
sequences of the reader’s favorite set-theoretic axioms. (The standard 
Zermelo-Fraenkel axioms will do nicely, if the reader has no favorite. We 
ask only that the set of axioms be recursive, and that it be strong enough 
to yield certain everyday facts about sets.) We need an interpretation x 
of Cn Ay into ST. (For the remainder of this section we assume a familiarity 


o with Section 2.7.) But the existence of such a x is a standard result of set 
- theory, although it is not usually stated in these words. We need formulas 


of the language of ST which adequately express the concept of being a 

natural number, being the sum of two given numbers, and so forth. 
The formula my is the result of eliminating the defined symbol w from the 

formula vı € œ. The formula sx is similarly obtained from the set-theoretic 


formula v, ~ Ø, and the formula sg is obtained from v, = v; U {vı}. The 


formula z< is simply vı E€ va. For m4, we use the translation into the lan- 
guage of ST of 


For any f, if f : œ X o —> w and for all a and b in w we have f(a, B) =a 
and f(a, b U {b} = f(a, b) VLG b)}, then f(v1, va) = 9s. 


(The manner of translation is partially indicated in Chapter 0.) The for- 
mulas x, and xp are obtained in much the same fashion. 

The claim that this a is an interpretation of Cn Ay into ST makes a num- 
ber (and the number is seventeen) of demands on ST. 


(i) Iw, æy must be in ST. It is, since we can prove in set theory that 
is nonempty. 

(ii) For each of the five function symbols f in the language of Ay, ST 
must contain a sentence asserting, roughly, that sep defines a function on the 
set defined by zy. (The exact sentence is set forth in the definition of inter- 
pretation in Section 2.7.) In the case of 0, we have in ST the result that 
there is a unique empty set and that it belongs to w. The case for S is simple, 
since zg defines a unary operation on the universe of all sets, and œw is 
closed under this operation. For + we must use the recursion theorem on w, 
That is, we can prove in ST (as sketched in Section 1.2) that there is a unique 
fw X wo such that f(a, Ø) = 4 and f(a, b U {b} = flab) U {f(@, 5)} 
for a, b in w. The required property of x, then follows. Similar arguments 
apply to - and E. 

(iii) For each of the eleven sentences o in Az, the sentence o” must be 
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in ST. For example, in the case of L3, we have in ST the fact that for any 
m and n in œ, either m € n, m =n, O NEM.. 


Since these demands are finite in number, there is a finite Ø € ST such 
that x is also an interpretation of Cn Ag into Cn ®. 


Theorem 36A (strong undecidability of set theory) Let T be a theory 
in the language of set theory such that T U ST (or at least T U ®) is con- 
sistent. Then #7 is not recursive. 


Proof Let A be the consistent theory Cn(T U ®). Let 4, be the cor- 


responding theory [A] in the language of number theory. From Section 
2.7 we know that A, is a consistent theory (since A is). Also Ag S 4o, 
since if o € Ag, then o” € Cn Ø c A. Hence by the strong undecidability 
of Cn Ag (Theorem 35C), 4o is not recursive. 

Now we inust derive the nonrecursiveness of T from that of 4,. We have 


coed, iff o EA 
and by the lemma below, #0” depends recursively on #0. Hence #4 cannot 
be recursive, lest #4, be. Similarly, we have 


tred iff (p> +) € T, 


where is the conjunction of the members of ©. Since #(p —> t) depends 
recursively on r, we have that #7 cannot be recursive lest #4 be. 


Lemma 36B There is a recursive function p such that for any formula 
a of the language of number theory, pia) = #(a”). 


Proof ïn Section 2.7 we gave explicit instructions for constructing a”. 
The construction in some cases utilized formulas 2” for formulas p simpler 
than a. The methods of the Section 3.3 and 3.4 can be applied to the Gö- 

‘del numbers of these formulas to show that p is recursive. But the details 
are not particularly attractive, and we omit them here. m 


Corollary 36C If set theory is consistent, then it is not complete. 


Proof Set theory has a recursive set of axioms. If complete, the theory 
is then recursive (by item 21 of Section 3.4). By the foregoing theorem, 
this cannot happen if ST is consistent. 


Corollary 36D In the language with equality and a two-place predicate 
symbol, the set of (Gödel numbers of) valid sentences is not recursive. 
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Partial proof In the foregoing theorem take T = Cn Ø, the set of valid 
sentences. The theorem then assures us that #7 is nonrecursive, provided 
that ® is consistent. We have not given the finite set ® explicitly. But we 
assure the reader that Ø can be chosen in such a way as to be provably 
consistent. a 


It should be noted that x is not an interpretation of Th N into ST (unless 
ST is inconsistent). For 2“ [ST] is a recursively enumerable theory in the 
language of Ñ, as a consequence of Lemma 36B. Hence it cannot coincide 
with Th 9, and it can include the complete theory Th ® only if it is incon- 
sistent. 


Gwdel’s second incompleteness theorem 


We can employ our usual tricks to find a sentence o of number theory 
which indirectly asserts that its own interpretation o” is not a theorem of 
set theory. For Jet D be the ternary relation on N such that 


a, b, c> € D iff a is the Gödel number of a formula æ of number 
theory and c is the Gödel number of a deduction 
from the axioms of ST of a(S?0)*. 


The relation D is recursive (by the usual arguments); let (v; , va, V3) rep- 
resent D in Cn Ag. Let r be the Gödel number of 


Yva ÖC, V1, Ys) 


and let ø be 
Vv, 76(S0, S'O, v5). 


Observe that o does indirectly assert that o” ¢ ST. We will now prove that 
the assertion is correct: 


Lemma 36K If ST is consistent, then o” ¢ ST. 


Proof Suppose to the contrary that o” is deducible from the axioms of 
ST; let k be Z of such a deduction. Then <r, r, kò € D. 


. Ap | Ô(S"0, S0, 5*0); 
. Ag | Avs ô(S70, 86, v3); 


Ag} oo. 
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Applying our interpretation z, we conclude that ~o” is in ST, whence ST 
is inconsistent. Thus 


ST is consistent = o” ¢ ST. 


Now the above proof, like all those in this book, is carried out in intuitive 
mathematics. But all of our work in the book could have been carried out 
within ST. Indeed it is common knowledge that essentially all work in 
mathematics can be carried out in ST. Imagine actually doing so. Then 
instead of a proof of an English sentence, “ST is consistent => o” ¢ ST,” 
we have a deduction from the axioms of ST of certain senténce in ve 
formal language of set theory: 


(Cons(ST) => (7}). 


Here Cons(ST) is the result of translating (in a nice way) “ST is con- 


sistent” into the language of set theory. Similarly, [7] is the result of trans- . 


lating “o” ¢ ST.” But we already have a sentence in the language of set 
theory asserting that o” ¢ ST. It is o”. This strongly suggests that [T] is 
(or is provably equivalent in ST to) o7, from which we get: 


(Cons(ST) —> 0o”) 


as a theorem of ST. 

Now this can actually be carried out in such a way as to have [7] be 0”. 
We have given above an argument which we bope will convince the reader 
that this is at least probable. And from it we now have 


Gödels Second Incompleteness | Thesrem The sentence Cons(ST) is 
not a theorem of ST, unless ST-is inconsistent. 


Proof By the above (plausibility) argument 
(Cons(ST) => 0”) 


is a theorem of ST. So if Cons(ST) is also a theorem of ST, then g" is too. 
But by Lemma 36E, if o” e ST, then ST is inconsistent. z 


Of course if ST is inconsistent, then every sentence is a theorem, includ- 
ing Cons(ST). Because of this, a proof of Cons(ST) within ST would not 
convince anyone that ST was consistent. (And by Gédel’s second theorem, 
it would convince him of the opposite.) But prior to Gédel’s work it was 
possible to hope that Cons(ST) might be provable from assumptions weaker 
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than the axioms of set theory, ideally assumptions already. known to be 


consistent. But we now see that Cons(ST) is not in any subtheory of ST, 


unless of course ST is inconsistent. 


EXERCISE 


Let T be a theory in a recursively numbered language, and assume that 
there is an interpretation of Cn Ag into T. Show that T is strongly unde- 
cidable; i.e., whenever T” is a theory in the language for which T U T” is 
consistent, then #7" is not recursive. 


§3.7 REPRESENTING EXPONENTIATION! 


In Sections 3.1 and 3.2 we studied the theory of certain reducts of N and 
found them to be decidable. Then in Section 3.3 we added both multiplica- 
tion and exponentiation. The resulting theory was found (in Section 3.5) 


` to, be undecidable. Actually i it would have been enough to add only multi- - 


plication (and forego exponentiation); we would still have undecidability. 
Let Ny be the reduct of N obtained by dropping exponentiation: 


RNa = (N, 0, $, <, +, +). 


Thus the symbol Æ does not appear in the language of Nar. Let Am be the 


set obtained from Ag by dropping El and E2. The purpose of this section 


‘is to show that all the theorems of Sections 3.3-3.5 continue to hold when 
“Ay? and “R are replaced by “Am” and * ‘i’. The key fact needed to 


establish this claim is that exponentiation is representable in Cn 4m. That 
is, there is a formula € in the language. of Ny, such that for anv a and b, 


Am + V2le(S%0, SO, z) <> z = SO). 


Thus s can be used to simulate the formula x Ey {= z without actual 
use of the symbol E. . 

If we look to see what relations and functions are representable in Cn Am, 
we find at first that everything (except for exponentiation itself) that was 
shown to be representable in Cn Ag is (by the same proof) representable in 
Cn Am. Until, that is, we reach item 7 in the catalog listing of Section 3.3. 
To go further, we must show that exponentiation itself is representable i in 


Cn Au: 


1This section may be omitted without loss of continuity. 
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We know that exponentiation can be characterized by the equations 


@ = 1, 


aot! =a a, 


From what we know about primitive recursion (catalog item 13 in Section 
3.3 plus Exercise 8 there), we might think of defining 


E*(a, b) = the least s such that [(s) = 1 and for 
all i < b, (Jn = (s) + al. 


For then a? = (E*(a, b)),. This fails to yield a proof of representability, 
because we do not yet know that the decomposition function (a), is repre- 
sentable in Cn Ay. But we do not really need that particular decomposition 


function (which corresponded to a particular way of encoding sequences), ` 


All we need is any function 6 which acts like a decomposition function; 
the properties we need are summarized in the following lemma. 


Lemma 37A There is a function ô representable in Cn Ay, such that 
for every n, do, ...,@,, there is an s for which ô(s, i) = a; for all i < n. 


Once the lemma has been established, we can define 


E**(a, b) = the least s such that [6(s,0) = 1 and for 
all i < b, d(s,i + 1) = d(s, i) - al. 


The lemma assures us that such ans exists. E** is then representable in 
Cn Aym, as is exponentiation, since 


aè = 6(E**(a, b), b). 


A function ô which establishes the lemma will be provided by some facts 
of number theory. 


A pairing function 


As a first step toward proving the foregoing lemma, we will construct 
functions for encoding and decoding pairs of numbers. It is well known that 
there exist functions mapping N x N one-to-one onto N. In particular, 
the function J does this, where in the diagram shown, J(a, b) has been 
written at the point with coordinates <a, bY: 
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For example, J(2, 1) = 8 and J(0, 2) = 3. To obtain an equation for 
Ja, b), we note that along the line x + y = n there are n + 1 points (with 
coordinates in N). Thus 


Ha, b) = the number of points in the plane to which J assigns smaller values 
= [the number of points on lines x + y =n for n= 05 Lees! 
(a + b —1)] + [the number of points on the line x + y = a + b 
for which x < a] 
=[1+2+---+@+)]+4 
=}@+b)a+b+)+a 
= ġie + by + 3a + b]. 


Let K and L be the corresponding projection functions onto the axes, 
i.e.,4the unique functions such that 


K(J(a, b)) = a, L(J(a, b) = b. 


For example, K(7) = 1, the x-coordinate of the point <1, 2> in the plane 

to which J assigned the number 7. Similarly, L(7) = 2, the y-coordinate 

of that point. l 
We claim that J, K, and L are representable in Cn Ay. The function- 


H(a) = the least b such that a < 2b 


has the property that H(a) = 4a for even a, Then we can write 


Ia, b) = H(a +b) (a+b +1) +a 
K(p) = the least a such that [for some b < p, J(a, b) = pl, 
L(p) = the least b such that [for some a < p, Xa, b) = p). 
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From the form of the four preceding equations we conclude that H, J 
K, and L are representable in Cn Ay. cues 


The Gödel -function 
Let B be the function defined as follows: 


P(c, d, i) = the remainder in c + [1 + (i + 1)- d] 
= the least r such that for some q < c, c = q+ [1 + (i+ 1)-d]+r 


This unlikely-looking function i 
i produces a satisfactory d iti 
function for Lemma 37A. Let eee ter 


is, i) = B(K(S), L(s), i). 


It is clear that ô is representable in Cn Aw. What is not so obvious is that 
it meets the conditions of Lemma 37A. We want to show: 


è 
(*) For any n and any a, ...,a,, there are numbers 
c and d such that £(c, d, i) = a; for all i <n. 


For then it follows that ô(J(c, d), i) = Be, d, i) = a; fori <n 

l Now (+) is a statement of number theory, not logic. The pivot of (+) 
is based on the Chinese remainder theorem. Numbers d), ..., d, are said 
to be relatively prime in pairs iff no prime divides both d; and d, tor i 2 $ 


Chinese Remainder Theorem Let d,....,d, be relatively prime in 


pairs; let a), ...,@, be natural numbers with each a; < d;. Then we can 
find a number c such that for all i < n, 


. a; = the remainder in c - d,. 
Proof Let p =IT;<,d;, and for any c let F(c) be the (n + 1)-tuple of 
remainders when c is divided by dọ, 
sible values for this (n + 1)-tuple. 
é We claim that F is one-to-one on {k :0 <k <p}. For suppose that 
(e) = Fie). Then each d; divides | c, — c, |. Since the d,’s are relatively 
pay p must divide | c, — c, |. For cy, c, less than p, this implies that 
1 = Cy, 
Hence the restriction of F to {k:0<k <p} takes on all p possible 
values. In particular, it assumes (at some point c) the value <ay, ..., Gn 
And that is the c we want. C] l 5 


..+,@,. Notice that there are p pos- 
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Lemma 37B For any s > 0, the s+ 1 numbers 
i+1-st, 1+2-s!,...,1+@4+1))-s! 
are relatively prime in pairs. 


Proof All these numbers have the property that any prime factor q 
cannot divide s!, whence q > s. If the prime q divides both 1 + j » stand . 
14+ k- sl, then it divides their difference, | j — k | + s!. Since q does not 
divide s!, it divides | j — k |. But | j— k| Ss <4. This is possible only 
if |j—k| =0. a 


Proof of (+) page 248 Assume we are given do, «-. » An; We need numbers 
cand d such that the remainder when c is divided by 1 + @+ \)- dis 
a;, for ion. 

Let s be the largest of {n, do, ...,4,} and let d= s!. Then by Lemma 
37B, the numbers 1 + (i + 1) « d are relatively prime in pairs for i <n. 
So by the Chinese remainder theorem there is a c such that the remainder 
ine > {1-4 (i+ 1)» d] is a; for isa. ' 


This completes the proof of Lemma 37A. And by the argument which- 
followed that lemma, we can conclude: 


Theorem 37C Exponentiation is representable in Cn Ay. 


Armed with this theorem, we can now return to catalog item 7 of Section 
3.3. The proof given there now establishes that the function in question 
(whose value at n is Pa) is representable in Cn 4y. For it was formed by 
allowable methods from relations and functions (including exponentiation) 
known to be representable in Cn Am. 

The same phenomenon persists throughout Sections 3.3 and 3.4. The 
representability proofs given there now establish representability in Cn Am. 
Thus any recursive relation is representable in Cn Am, and if the relation 
happens to be a function, then it is functionally representable. The proofs 
given in Section 3.5 then apply to Rar and Am as well as to N and Ag. 
In particular, we have the strong undecidability of Cn Ay: Any theory T 
in the language of Ny, for which T U Am is consistent cannot be recursive. 

Notice that any relation definable in % (i.e., any arithmetical relation) 
is also definable in Ny. For exponentiation, being representable in a sub- 
theory of Th Nys, is a fortiori definable in Nar. By the new version of Tarski’s 
theorem, #Th Ny is not definable in Nay, and consequently #Th Ny, cannot 
be arithmetical. 
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FAN 2 l ' A) | 
ae a 3 a pE: In the terminology of Section 2.7, we can.say that there is a faithful 
a S E “ae =~ + = i interpretation of Th N into Th Nar. It equals the identity interpretation on 
p 2 -p T E $ zA R all parameters except E, and to E it assigns a formula defining exponentia- 
: F aa SS PEM tion in Ry. l 
3 E = aa Bee Z A In Table x we summarize some of the results of Chapter 3 on number 
= cf ae 2 8 zE Š theory and its reducts. 
oe a 2 
n EXERCISES 
n § g § ; | 
B 2 z f $2 g 2 i E l 1. Let D(a, b) = (ay. Show that any arithmetical relation is definable 
2 E eg g Fi E g g 3 3 5 in the structure (N, <, D). 
O- ra ` : 
a z = d F g 3 z hi ah: E F 2. Show that the addition relation {<a, b,c) :a + b = c} is definable 
A Ya 27 S a 5 f E 3 oS in the structure (N, S, +). Suggestion: Under what conditions does the 
as gt at oe? B Sp equation S(ac) « S(bc) = S(c + c » S(ab)) hold? 
So ik V fy + mM. = 
Pe 3, (a) Show that Th(Z, +, +) is strongly undecidable. (See the exercise 
D $ D as E i By g 4d of Section 3.6.) 
X 3 s # g 5 5 £ g z £ d 2a (b) (This part assumes a background in algebra.) Show that the Hen: 
v as 5 BA K R YË a ‘a g B of rings is undecidable and that the theory of commutative rings is unde- 
: a ge os P ag i B. E cidable. 
ajej a ep Bede gF etg 
3 a 4 3 E $ 5 i: 8 f E E ; gi 8 §3.8 RECURSIVE FUNCTIONS 
3] È pa ge 8 oS RZE SRE | 
4 2 < Š d < We have used recursive functions (i.e., functions which, when viewed as 
DB eo eed relations, are recursive) to obtain theorems of incompleteness and un- 
ge a | ee. g decidability of theories. But the class of recursive functions is also an in- 
E < & 2 £ 2 E teresting class in its own right, and in this section we will indicate a few 
È Š g5 4 8 3 J 3 of its properties. ele 
rs E £ , mg E g & 9 £ Recall that by Church’s thesis, a function is recursive iff it is computable 
sas. $ g 34 a aq g EE g by an effective procedure (page 201). This fact is responsible for much of 
p R E & a 8 3 g E g S g © A the interest in recursive functions. At the same time, this fact makes pos- 
a < Ë j < a 8 5 R & sible an intuitive understanding of recursiveness, which greatly facilitates 
` the study of the subject. Suppose, for example, that you are suddenly asked 
whether or not the inverse of a recursive permutation of N is recursive. 
x R Before trying to prove this, you should first ask yourself the intuitive coun- 
É = p + terpart: Is the inverse of a computable permutation f also computable? 
È V V Vv You then (hopefully) perceive that the answer is affirmative. To compute 
n Ps ao a (2 f-G), you can compute f(0), /(1), ... until for some k it is found that 
ee S S S f(k) = 3. Then f—1(3) = k. Having done this, you have gained two advan- 
g z & z S z tages. For one, you feel certain that the answer to the question regarding 
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recursive permutations must also be affirmative. And second, you have a 
good outline of how.to prove this; the proof is found by making rigorous 
the intuitive proof. This strategy for approaching problems involving re- 
cursiveness will be very useful in this section. 

Before proceeding, it might be wise to summarize here some of the facts 
about recursive functions we have already established. We know that a 
function f is recursive iff it (as a relation) is representable in Cn Ag, by 
Theorem 34A. Consequently, every recursive function is weakly represen- 
table in this theory. 

In Section 3.3 a repertoire of recursive functions was developed. In ad- 
dition, it was shown that the class of recursive functions is closed under 
certain operations, such as composition (Theorem 33L) and the “least- 
zero” operator (Theorem 33M). 

We also know of a few functions which are not recursive. There are 
Wo functions from N” into N altogether, but only $, of them can be 
recursive. So an abundance of nonrecursive functions exists, despite the 
fact that the most commonly met functions (like the polynomials) were 
shown in Section 3.3 to be recursive. By catalog item 1 of Section 3.3, 
the characteristic function of a nonrecursive set is nonrecursive. For exam- 
ple, if f(a) = 1 whenever a is the Gödel number of a member of Cn Ag 
and f(a) = 0 otherwise, then f is not recursive. 


Normal form 


For any computable function, such as the polynomial function a” + 3a + 
5, one can in principle design a digital computer into which one feeds a 
and out of which comes a? + 3a + 5 (Fig. 11). But if you then want a 
different function, you must build a different computer. (Or change the 
wiring in the one you have.) It was recognized long ago that it is usually 
more desirable to build a single general-purpose stored-program computer. 
Into this you feed both a and the program for computing your polynomial 


a? + 3a+5 


Figure 11. Special-purpose computer. 
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> F 


Figure 12. General-purpose computer. 


program 


a? +3a+5 


(Fig. 12). This “universal” computer requires two inputs, and it will com- 
pute any one-place computable function (if supplied with enough memory 
space), provided that the right program is fed into it. Of course, there are 
some programs which do not produce any function on N, as many a pro- 
grammer has, to his sorrow, discovered. 

In this subsection and the next, we will repeat what has just been said, 
but with recursive functions and with proofs. For our universal computer 
we will have a recursive relation T; and a recursive function U. Then for 
any recursive f : N — N there will exist an e (analogous to the program) 
such that i 

f(a) = U (the least k such that <e, a, k> € Ty) 


= U(uk <e, a, k) € Ty); 


where the second equation is to be understood as being an abbreviation 
for the first. Actually e will here be the Gédel number of a formula 
which represents (or at least weakly represents) f in Cn Ay. And the numbers 
k for which <e, a, k> € T, will encode both f (a) and Z of a deduction from 


Ay of 9(S%0, S10). 


Definition For each positive integer m, let Tp be the (m + 2)-ary re- 
lation to which an (m + 2)-tuple <e, a, -+ am» ky belongs iff 
(i) e is the Gödel number of-a formula p in which only 11, . s Yms 


Vm occur free; : l 
(ii) k is a sequence number of length 2, and (k) is Y of a deduction from 


Ag of 9(S%0, ..., 890 , S0). 


The idea here is that for any one-place recursive function f we can first 
of all take e to be the.Gédel number of a formula p weakly representing f 
(as a relation). Then we know that for any a and b, 


An (80, 8%) if b =f). 
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So any number k meeting clause (ii) of the definition must equal <(k),, 
f(@>, where (k) is F of a deduction of y(S0, S10) from Ag. (We have 
departed from the usual definition of T, here by not requiring that k be 
as small as possible.) 


Take for the “upshot” function U the function 
Uk) = (k). 


This U is recursive and in the situation described in the preceding paragraph 
we have U(k) = f(a). 


Lemma 38A For each m, the relation Tp is recursive. 


Proof, for m=2 <le, a,,0,,k> € T, iff e is the Gödel number of a 
formula, (Vv, Vv, Yv) * e is the Gödel number of a sentence, k is a se- 
quence number of length 2, (k) is Z of a deduction from Ag of ` 


Sb(Sb(Sb¢e, Hv: , g(@y)), Hea, 8(42)), Hes, 8((A)1)), 


where g(n) = #50. From Section 3.4 we know all this to be recursive. 


Theorem 38B (a) For any recursive function f: N™— N, there is an 


e such that for all a,,..., ams 


I, eara s Ay) E U(uk<e, ais» ema Ams KÒ E€ Ta). 


(In particular, such a number k exists.) 

(b) Conversely, for any e such that Va, --+ dm Ikle, a, .. 
the function whose value at a, .. 
is recursive. |... 


69 Amy kh ED ny, 
2 an is U(uk<e, Alg rey Bins kò E€ Tn) 


Proof Part (b) follows immediately the fact that U and Tp are recursive. 
As for part (a), we take for e the Gödel number of a formula y weakly 
representing fin Cn Ay. Given any d, we know that A, | g(S%0, ..., S70, 
S10), If we let d be Y of a deduction from Ay of this sentence, then 
<e, d, <d, f(@)>> € Tn. Hence there is some k for which <e, &, kY € Tm- 
And for any such k, we know that Ag | p(S%0, ..., Sëm0, S%0), since 
(k) is F of a deduction. Consequently, U(k) = (k) = f(@) by our choice 
of p. Thus we have U(uk<e, d, kY € Ta) = f(@). 


This theorem, due to Kleene in 1936, shows that every recursive function 
is representable in the normal form 


` f(@) = Ulukée, È, ky € Tp). 
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Thus a computing machine able to calculate U and the characteristic 
function of T, is a “universal” computer for one-place recursive functions. 
The input e corresponds to the program, and it must be chosen with care 
if any output is to result (i.e., if there is to be any k such that <e, a, kò € T)). 


Recursive partial functions 


The theory of recursive functions becomes more natural if we consider 
the broader context of partial functions. 


Definition A m-place partial function is a function f with dom f & N” 
and ranf S N. If @¢ dom f, then {(@) is said to be undefined. If dom f 
= N”, then f is said to be zotal. 


The reader is hereby cautioned against reading too much into our choice 
of the words “partial” and “total” (or the word “undefined,” for that mee 
ter). A partial function f may or may not be total; the words “partial 
and “total” are not antonyms. l RDE E: 

We will begin by looking at those partial functions which are intuitively 
computable. 


*Definition An m-place partial function f is computable iff there is an 
effective precedure such that (a) given an m-tuple din dom f, the procedure 
produces f(a); and (b) given an m-tuple @ not in dom f, the procedure pro- 
duces no output at all. 


This definition extends the one previously given for total functions. 
At that time we proved a result (Theorem 33H), part of which generalizes 
to partial functions. 


*Theorem 38C An m-place partial function f is computable iff f (as 
an (m + 1)-ary relation) is effectively enumerable. 


Proof The proof is reminiscent of the proof.of another result, Theorem 
17E. First suppose we have a way of effectively enumerating f. Given an 
m-tuple @, we examine the listing of the relation as the procedure churns 
it out. If and when an (m + 1)-tuple beginning with @ appears, we print 
out its last component as f(@). 


Conversely, assume that f is computable, and first suppose that fis a 
one-place partial function. We can enumerate fas a relation by the follow- 
ing procedure: 
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1. Spend one minute calculating /(0). 

2. Spend two minutes calculating /(0), then two minutes calculating fO). 

3.. Spend three minutes calculating. /(0), three minutes calculating /(1), 
and three minutes calculating f (2). 


And so forth. Of course, whenever one of these calculations produces 
any output, we place the corresponding pair on the list of members of the 
relation f. 

For a computable m-place partial function, instead of calculating the 
value of f at 0, 1, 2, ... we calculate its value at <(0),, ..., On- 
Los s Dm- (Qos s+ +s (2)m—1>» etc. E 


In the case of a computable total function J, we were also able to con- 
clude that f was a decidable relation. But this may fail for a nontotal f. 
For example, let 2 i 

_ 0 if a c ¥Cn Ag, 
fa = ENET otherwise. 


Then f is computable., (We compute f(a) by enumerating #tCn Ag and 
looking for a.) But f is not a decidable relation, lest #Cn Ap be decidable. 
On the basis of this example and the foregoing theorem, we select our 


definition for the precise counterpart of the concept of computable partial 
function, 


Definition A recursive partial function is a partial function which, as 
a relation, is recursively enumerable. . 


The reader should be warned that “recursive partial function” is an indi- 
visible phrase; a recursive partial function need not be (as relation) recursive. 
But at least for a total function our terminology is consistent with past 
practice. 


Theorem 38D Let f: N™— N be a total function. Then f is a recursive 
partial function iff f is recursive (as a relation), 


Proof If fis recursive (as a relation), then a fortiori f is recursively enu- 
merable, Conversely, suppose that f is recursively enumerable, Since S 
is total, 


J(@) Æ b < Jeffa) = c&bA ch. 


The form of the right-hand side shows that the complement of f is also 
recúrsively enumerable, Thus by Theorem 35F, J is recursive. 


(Fig. 13). For any computa 
which computes it. But now t 
some computable partial functio 
the empty function, but that is a compu 


3.8 Recursive Functions 257 


i ictured a two-input device 

iscussing normal form results, we P l i 
ee : ble partial function, there is some program 
he converse holds: Any program will produce 
n. Of course many programs will produce 


table partial function. 


se 
rc 


Figure 13, Computer with program for f. 


program 


(a) 


ee De- 
For the recursive partial functions the same considerations apply. D 


fine, for each e € N, the m-place partial function felm by 


felme prs Am) i U(uk <e, Ajs eeo Ams kò E€ Tn) 


The right-hand side is to be understoo 
In other words, 


d as undefined if there is no such k. 


de dom felm iff Ake, ay, -> Am K E Tmo 


q) is gi 5 tion. 
in which case the value fe],(@) is given by the above equa 


-+ 1)-place partial 


: : a) The (m 
Normal Form Theorem (Kleene, 1943) (a) an) is a recursive 


function whose value at <e, d1, -+> An> is [elmi +++ 
partial function. 
(b) For each e = 9, lelne 
(c) Any m-place recursive par 


is a m-place recursive partial function. 
tial function equals [e],, for some e. 


s ER ` = b & 
Proof (a) We have fel, (2) = b <> Ikie, 4, Kye T & ee 7 
(Wki < k) <e, d, k'> € T.,) The part in square brackets is rec ; 
the function (as a relation) is recursively a a 
(b) The above proof still applies, e now being he $ o saa 
) Let f be an m-place recursive partial function, so that (<a, ; $ 
G is recursively enumerable. Hence there is a a ; a be 
n l in = For i = b, 
is relation in Cn Ag. We claim that f= Pln- = 
ava Jou : Sand %0). Hence there is a k such that dy, 4, k> 
en Ag pr peers , 
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ET yp. For any such k, U(k) = b, since Ay / 9(S%0, ...,S%0, SO) for 
cÆ b. Similarly, if f (Z) is undefined, then Ag / o(S“0, ..., S%0, SO) for 
any c, whence [f#p]n is undefined here also, a 


The function fe], is said to be the m-place recursive partial function with 
index e. Part (c) of the normal form theorem tells us that every recursive 
partial function has an index. The proof shows that the Gödel number of 
a formula weakly representing a function is always an index of the 
function, 

We now have a convenient indexing [Olh , [1], ... of the one-place re- 
cursive partial functions. Function [e], is produced by the “instructions” 
encoded by e. Of course, that function will be empty unless e is the Gödel 
number of a formula and certain other conditions are met. 

All the recursive total functions are included in our enumeration of 
recursive partial functions. But we cannot tell effectively by looking at a 
number e whether or not it is the index of a total function: 


Theorem 38E {e : fe], is total} is not recursive, 
Proof Call this set A. Consider the function defined by 


f(@ = te +1 


if ae A, 
if a ¢ A. 


Then J, by its construction, is total. Is it recursive? We have 


{@=be|a@¢A&b=0) or (ac A& Akka, a, k) ET, 
& b = Uk) + 1 &(Vj < kXa, a, j> ¢ TDI. 


Thus if A is recursive, then f (as a relation) is recursively enumerable. But 
then f is a total recursive function, and so equals [e], for some e € A. 


But f(e) = fe],(e) + 1, so we cannot have f= [e],. This contradiction 
shows that A cannot be recursive. a 


l It is not hard to show that A is in J7,. This classification is the best pos- 
sible, as it can be shown that A is not in 2. 


Theorem 38F The set 
K = {a : [a],@) is defined} 


is recursively enumerable but hot recursive. 
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Proof K is recursively enumerable, since a € K <> Jk la, a, ky € Ti. 
To see that K cannot be recursive, consider the function defined by 


ae { Ne +1 if ae K, 


if ad K. 


This is a total function. Exactly as in the preceding theorem, we have that 
K cannot be recursive. [E 


Corollary 38G (unsolvability of the halting problem) The set 
{<a, b> : bhe) is defined} 
is not recursive. 


This corollary tells us that there is no effective way to tell, given a pro- 
gram b for a recursive partial function and an input a, whether or not 


the function [b], is defined at a. 
We can obtain an indexing of the recursively enumerable relations by 


‘using the following characterization. 


; : f fs 
Theorem 38H A relation on N is recursively enumerable iff it is the 
domain of some recursive partial function. 


Proof The domain of any recursively enumerable relation is also recur- 
sively enumerable; cf. part 4 of Theorem 35E. In particular, the domain 


of any recursive partial function is recursively enumerable. 
Conversely, let Q be any recursively enumerable relation, where 


de o< Iba, b) ER 


with R recursive. Let 
| f(a) = ub a, by € R; 
ie, 
f@ = b < Ga, bye R&(Ve <b) a,c) E R. 
Then f, as a relation, is recursive. Hence f is a recursive partial function. 
Clearly its domain is Q. 


Thus our indexing of the recursive partial functions induces an indexing 
of the recursively enumerable relations. Define 


W, = dom feh. 
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Then Wo, W1, Wa, ... is a list of all recursively enumerable subsets of N. 
In Theorem 38E we showed that {e : W, = N} is not recursive. Similarly, 
Theorem 38F asserts that {e : e e W,} is not recursive. Define a relation 
Q by 

Q = {<a, b> :a€ Wp}. 


Then Q is recursively enumerable, since <a, b) € Q <> Jk <b, a, kò € Ty. 
Furthermore, @ is universal for recursively enumerable sets, in the sense 
that for any recursively enumerable A S N there is some b such that 
A = {a : <a, b) € Q}. The unsolvability of the halting problem can be 
stated: Q is not recursive. 


Reduction of decision problems 


Suppose we have a two-place recursive partial function f. Then we claim 
that, for example, the function g defined -by 


g(a) = f(3, a) 


is also a recursive partial function. On the basis of intuitive computability 
this is clear; one computes g by plugging in 3 for the first variable and then 
following the instructions for f. A proof can be found by formalizing 


this argument. There is some formula y = (v4, Vz, va) which weakly rep- 


resents f (as a relation) in Cn Ay. Then g is weakly represented by 9(S°0, 
vı, V2), provided that v, and v, are substitutable in o for v, and v,. (If not, 
- we can always use an alphabetic variant of p). 

Now all this is not very deep. But by standing back and looking at what 
was said, a more subtle fact is perceived. We were able to transform effec- 
tively the instructions for f into instructions for g. So there should be a re- 
cursive function which, given an index for fand the number 3, will produce 
an index for g. 


Parameterization Theorem For each m>1 and n> 1 there is a re- 
cursive function @ such that for any e, đ, b, l 


b,) = fele, a, ..-, An) (Or, «++» ba) 


(Equality here means of course that if one side is defined, then so also is 
the other side, and the values coincide.) 


lelima» EURT ams by AE i 


On the left side of the equation g consists of arguments for the func- 
tion [e]m+n; on the right side g consists of parameters upon which the func- 
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tion lole, Z)]}, depends. In the example we had m = n = 1 and a, = 3. 
Since ọ depends on m and n, the notation “o?”’ would be logically prefera- 
ble. But, in fact, we will use just “o.” 

Proof, for m=n=1 It is possible to give a proof along the lines in- 
dicated by the discussion that preceded the theorem. But to avoid having 
to cope with alphabetic variants, we will adopt a slightly different strategy. 

We know from the normal form theorem that the three-place partial 
function / defined by ; 
h(e, a, b) = fela, b) 


is a recursive partial function. Hence there is a formula y which weakly 
represents h (as a relation). We may suppose that in p the variables », and 
v», are not quantified. We can then take 


ole, a) = fy(S0, S70, vi, vo) 
= Sb(Sb(Sb(SbEHy, ito, , #50), itv, > S70), Hos , Hv), Hvas Pv). 


Then o(e, a) is the Gödel number of a formula weakly representing the 
function g(b) = [e],(a, b). Hence it is an index of g. a 


We will utilize the parameterization theorem to show that certain sets 
are not recursive. We already know that K = {a : [a]j(a) is defined} is 
not recursive. For a given nonrecursive set A we can sometimes find a 
(total) recursive function g such that 


acKkK<sg(aEead 


or a (total) recursive function g’ such that 
ag Ko g@ead. 


‘In either case it then follows at once that A cannot be recursive lest K be. 

(In the former case K is said to be many-one reducible to A; in the latter 
case the complement K of K is many-one reducible to A.) The function 
g or g’ can often be obtained from the parameterization theorem. 


EXAMPLE {a : W, = Ø} is not recursive, 


Proof Call this set A. First, note that A € J, since W, = -Bif Vb Vk 
<a, b, ky ¢ Ti. Consequently, K cannot be many—one reducible to A, 
but it is reasonable to hope that K might be. That is, we want a total re- 
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cursive function g such that 
[a],(@) is undefined = dom [g(a)]; = Ø. 


This will hold if for all b, [g(a)],(b) = [a](a). So start with the recursive 
partial function 


F(a, b) = fala) 
and let g(a) = o(f, a), where fis an index for f. Then 
IDLO) = Leh DLO) = f(a, 6) = fahla). 
Thus this g shows that K is many—one reducible to A. fa 


Theorem 381 (Rice, 1953) Let Z be a set.of one-place recursive partial 
functions. Then the set {e : [e], € Z} of indices of members of Z is recur- 
sive iff either @ is empty or Z contains all one-place recursive partial func- 
tions. 


Proof Let Ig = {e : le], E€ Z} be the set of indices of members of Z. 


Case I: The empty function @ is not in Z. If nothing at all is in @ we 
are done, but suppose some function y is in Z. We can show that Kis many- 
one reducible to Tg if we have a recursive total function g such that 


ifae K, 


Y 
@h=-{), inte 


t 


For then a € K + g(a) € Ig. l 
We can obtain g from the parameterization theorem by defining 


g(a) = ole, a), 
` where ' 
if aek, 


lehta, 8) = {0 ee 


undefined 


The above is a recursive partial function, since 
lella, b) = c <>a e K&y(b)=c 


and the righi-hand side is recursively enumerable. 7 

Case II: @ € g. Then apply case I to the complement @ of Z. We can 
then conclude that 7g is not recursive. But Ig is the complement of Ig, so 
Ig cannot be recursive. | l 


4 
; 
q 


ETI 
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EXAMPLES For any fixed e, the set {a : W, = W,} is not recursive, as a 
consequence of Rice’s theorem. In particular, {a : Wa = Ø} is not recur- 
sive, a result proved in an earlier example, For two other applications of 
Rice’s theorem, we can say that {a : W, is infinite} and {a : W, is recursive} 
are not recursive. 


Register machines 


There are many equivalent definitions of the class of recursive functions, 
Several of these’ definitions employ idealized computing devices, These 
computing devices are like digital computers but are free of any limitation 
on memory space. The first definition of this type was published by Alan 
Turing in 1936; similar work was done by Emil Post at roughly the same 
time. We will givé here a variation on this theme, due to Shepherdson and 
Sturgis (1963). ; 

A register machine will have a finite number of registers, numbered 1, 
2,...,K. Each register is capable of storing a natural number of any 
magnitude. The operation of the machine will be determined by a program. 
A program is a finite sequence of instructions, drawn from the following list: 

Ir (where 1 <r < K). “Increment r.” The effect of this instruction is 
to increase the contents of register r by 1. The machine then proceeds to 
ihe next instruction in the program. i 

Dr (where | <r < K). “Decrement r.” The effect of this instruction 
depends on the contents of register r. If that number, is nonzero, it is de- 
creased by | and the machine then proceeds, not to the next instruction, 
but the following one. But if the number in register r is zero, the machine 
just proceeds to the next instruction. In summary: The: machine tries to 
decrement register r and skips an instruction if it is successful, 

Tq (where g is an integer— positive, negative, or zero). “Transfer q.” 
All registers are left unchanged. The machine takes as its next instruction 
the qth instruction following this one in the program (if q > 0), or the 
| ¢ |th instruction preceding this one (if q < 0). The machine halts if there 
is no such instruction in the program. An instruction of T 0 results in a loop, 
with the machine executing this one instruction over and over again. 


EXAMPLES 1. Program to clear register 7, 


*~D 7 Try to decrement 7. 
[7 2- i 
T -2| Go back and repeat. 
Halt. 
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2. Program to move a number from register r to register s. 


Clear register s (Use the program of the first example.) 


+D r Take 1 from r. 
T 3 Halt when zero. 
I s Add | to s. 


T -—3 Repeat. 


This program has seven instructions altogher. It leaves a zero in reg- 
ister r. 


3. Program to add register 1 to registers 2 and 3. 


4, (Addition) Say that a and b are in registers 1 and 2. We want a + b 
in register 3, and we want to leave a-and b still in registers 1 and 2 at 
the end. 


Register contents 


Clear register 3. a b 0 

Move number from register 1 to register 4. 0 b 0 a 
Add register 4 to registers 1 and 3. a b a 0 
Move number from register 2 to register 4. a 0 a b 
Add register 4 to registers 2 and 3. a b a+b 0 


This program has twenty-seven instructions as it is written, but three of 
them are unnecessary. (In the fourth line we begin by clearing register 4, 
which is already clear.) At the end we have the number a back in register 
1. But during the program register 1 must be cleared; this is the only way 
of determining the number a. 


5, (Subtraction) Let a + b = max(a — b, 0). We leave this program to 
the reader (Exercise 10). l 


Now suppose f is an n-place partial function on N. Possibly there will 
be a program P such that if we start a register machine (having all the 
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registers to which P refers) with a, .--+4n in registers 1,...,7 and 
apply program P, then 


(1) If fG,.-+> a») is defined, then the calculation eventually terminates 
with f(a, > ap) is register n + 1. Furthermore, the calculation terminates 
by seeking a (p + 1)st instruction, where p is the length of P. 

(2) If f(ais +++ d,) is undefined, then the calculation never terminates. 


If there is such a program P, we say that P calculates f. 


Theorem 385 Let fbea partial function. Then there is a program which 
calculates f iff f is a recursive partial function. 


Thus by using register machines we arrive at exactly the class of recursive 
partial functions, a class we originally defined in terms of representability 
in consistent finitely axiomatizable theories. The fact that such different 
approaches produce the same class of partial functions is evidence that 
this is a significant class. 


Outline of proof To show that the functions calculable by register ma- 
chines are recursive partial functions, one “arithmetizes calculations” in 
the same spirit as we arithmetized ‘deductions in Section 3.4. That is, one 
assigns Gédel numbers to programs and to sequences of memory configura- 
tions. One then verifies that the relevant concepts, translated into numerical 


_ relations by the Gödel numbering, are all recursive. (After going through 


this, one perceives that, from a sufficiently general point of view, deductions 
and calculations are really about the same sort of thing.) 

Conversely, to show that the recursive partial functions are calculable 
by register machines, one can work through Sections 3.3 and 3.4 again, 
but where functions were previously shown to be representable in Ca Ag, 
they must now be shown to be calculable by register machines. This is not 
as hard as it might sound, since after the first few pages, the proofs are all 
the same as the ones used before. There is a reason for this similarity. It 
can be shown that the class of all recursive functions is generated from a 
certain handful of recursive functions by the operation of composition (in 
the sense of Theorem 33L) an the “Jeast-zero” operator (Theorem 33M). 
Much of the work in Sections 3.3 and 3.4 amounts to a verification of this 
fact. Thus once one has shown that each function in this initial handful 
is calculable by register machine and that the class of functions calculable 
by register machines is closed under composition and the least-zero operator, 
then the earlier work can be carried over, yielding the calculability of all 
recursive functions. 
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EXERCISES 


1. Define functions f and g by 


J 0 if Fermat’s last theorem is true, 
Uu otherwise, 
is a run of at least n consecutive 7’s, 


0 if in the decimal expansion of x there 
g(n) = | 
1 otherwise. 


Is f recursive? Is g recursive? 


2. (a) Show that the range of any recursive partial function is recursively 
enumerable. 

(b) Show that the range of a nondecreasing total recursive function is 
recursive, 


3. (a) Let A be a nonempty recursively enumerable subset of N. Show 
that A is the range of some total recursive function. 

(b) Show that any infinite recursively enumerable subset of N includes 
an infinite recursive subset. — , 


4, Show that every recursive partial function has infinitely many indices. 


5, Give an example of a function f and a number e such that for all a, 
S@ = Uluk <e, a, k ET) 
’ but e is not the Gödel number of a formula weakly representing fin Cn Ag. 


6. Show that the parameterization theorem can be strengthened by 
requiring ọ to be one-to-one. 


7. Recall that the union of two recursively enumerable sets is recursively 
enumerable (Exercise 7 of Section 3.5). Show that there is a total recursive 
function g such that Wye) = Wa U Wr. 


8. Show that {a: W, has two or more members} is in X, but not in J. 


9, Show that there is no recursively enumerable set A such that {fal +4 
€ A} equals the class of total recursive functions on N. 


10. Give register machine programs which calculate the following func- 
tions: 

(a) Subtraction, a ~ b = max(a — b, 0). 

(b) Multiplication, a + b. 

(c) max(a, b). 
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11. Assume that there is a register machine program which calculates 
the n-place partial function f. Show that given any positive integers ry, ..., 
r, (all distinct), p, and k, we can find a program Q such that whenever we 
start a register machine (having all the registers to which Q refers) with 
4, .. +) Gin registers r,, ..., Fa and apply program Q, then (i) if f(a,,..., 
d,) is defined, then the calculation eventually terminates with f(a,, ..., Gp) 
in register p, with the contents of registers 1,2,..., k (except for register 
p) the same as their initial contents, and furthermore the calculation ter- 
minates by seeking a (q + 1)st instruction, where q is the length of Q; 
(ii) if f(a, -> an) is undefined, then the calculation never terminates. 


12. Let g : N*+1-+ N be a (total) function which is calculated by some 
register machine program. Let f(a), ...5 An) = pb[g(ay, «+++ an, b) = 0), 
where the right-hand side is undefined if no such b exists. Show that the 
partial function f can be calculated by some register machine program. 


CHAPTER FOUR 


Second-Order Logic 


§41 SECOND-ORDER LANGUAGES 


-We can obtain richer, : 
tain richer, more expressive languages than the first-order 


languages considered 
i thus far, by allowi uantificati i 
E A a ing quantification of predicate or 


Ax(Px —> Wx Px) 


is a valid formula havin i 
g V and P as its ni 
matter how P is interpreted, parameters. Because it is true no 


WP 3x(Px -> Vx Px) 


as a predicate variable.) r 


Suppose, then, that we h i i 
DoS ; ave in addition to the symbols i 
the beginning of Section 2.1, the further logical alee Penne 


4, Predicate variabl 
f es: For each positive i 
predicate variables positive integer n we have the n-place 


n 
XP, XB. . 
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5. Function variables: For each positive integer n, we have the n-place 
function variables 
FY, FE, 


The usual variables v1, 2, --- will now be called individual variables, 
to avoid confusion. The set of terms is as before defined as the set of expres- 


- sions which can be built up from the constant symbols and the individual 


variables by applying the function symbols (both the function parameters 
and the function variables). Atomic formulas are again expressions Ph ++: tas 
where fi, -<-> Én are terms and P is an n-place predicate symbols (param- 
eter. or variable). The definition of wif is augmented by new formula- 
building operations: If p is a wif, then so also are VX? 9 and WF? p. The 
notion of a variable occurring free in g is defined just as before. A sentence 
is a wi g in which no variable (individual, predicate, or function) occurs 
free. 

It should be remarked that the roles played by predicate parameters 
and free predicate variables are essentially ihe same. There is the same 
close relationship between constant symbols and free individual variables. 

By a structure we continue to mean a function on the set of parameters 
meeting the conditions set forth in Section 2.2. We must extend the defi- 
nition of satisfaction in the natural way. Let V now be the set of all vari- 
ables, individual, predicate, or function. Let s be a function on V which 
assigns to each variable the suitable type of object. Thus s() is a member 
of the universe, s(X”) is an n-ary relation on the universe, and s(F*) is an 
n-ary operation. For a term t, (£) is defined in the natural way. In partic- 
ular, if F is a function variable, then 5UFt, “++ tn) is the result of applying 
the function s(F) to S(t), «+ + > Ela) Satisfaction of atomic formulas is 
also defined essentially as before. For a predicate variable X, 


eg Xt eee tall if GA), «+» Stn)? E s(X). 


The only new features in the definition of satisfaction arise from our new 
quantifiers. i 


5, Hy VX? Ls] iff for every n-ary relation R on [MN], Fup [s(X?|R)}. 
6. Ey VE? o [s] iff for every function f: |A je> ||, Fa? meguea)o 


Again it is easy to see that only the values of s at variables oceurring free 
in the formula are significant. For a sentence o, we may unambiguously 


speak of its being true or false in %. Logical (semantical) implication is 
defined exactly as before. 
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EXAMPL . ; 
Eo n r well-ordering is an ordering relation such that an 
conditio y as a least (with respect to the ordering) element. This 1 : 

n can be translated into the second-order sentence = 


WX(Cy Xy — Iy(Xy 0 Vz(Xz => y < z))). 


Her i 
reas Wr apple we omit the subscripts on X and F when they are i 
7 , and we omit the superscripts if they are clear from the Sie. 


EXAMPLE i 
Pa ae ne : Peano’s postulates (the induction postulate) states 
D A R aral numbers which contains 0 and is closed under the 
a on is, in fact, the set of all natural numbers. This can b 
ated into the second-order language for number theory as i 


VX(XO A Wy(Xy > XSy) — Wy Xy). 


Any mod 
aoe : ie 5 S PoR the above Peano induction postulate is iso- 
] l , 0, S); see Exercise 1. Thus this i ic 
i.e., all its models are isomorphic. Rae ees 


EXAMPLE 3 For an 
y formula g in whi redi ; 
does not occur free, the n See gr ee 


IX” Wo, «++ Wo, [Xo +, Og] 


is valid, ' vari 
anes Mey fees variables may occur free in p in addition to v; Vy») 
peice he m rA are called relation comprehension formulas There ate 
ous function comprehension form i ' 
; ; ulas. If y i 
which the variable F” does not occur free, then EEEE 
> 


Voz se Wo, Ilva Y => SF! Wo ee Vo, (FP, + Vn = Pap oy) 
ni Yat 


is valid. (Here “Aly ” ig a 
i an ab : 
Exercise 21 of Sec es [os abbreviation for a formula obtained from 


EXAMPL 
PIE4 In the ordered field of real numbers, any bounded nonempty 


und, ; te t i e seco = er sen 


WX[dyp Vz(Xz => z < y) A Az Xz = 
dy Vy (Vz(Xz >z < y) ey Sy’). 
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ExampLe 5 For each n => 2, we have a first-order sentence A, which 


translates, “There are at least n things.” For example, Ag is 
ax 3y a(x qe yA x AB ZAY AP z) 
5 for its class of models the EC, class consisting 


order sentence which is 
no last 


The set (Ag, As, ---} ha 
of the infinite structures. There is a single second- 
equivalent. A set is infinite iff there is an ordering on it having 
element. Or more simply, a set is infinite iff there is a transitive irreflexive 
relation R on the set whose domain is the entire set, This condition can be 


translated into a second-order sentence Neo: 


Ax[Vu Vv Vw kur > Xpw => Xuw) a Vu Xu A Wu dv Xul. 


Another sentence (using a function variable) which defines the class of 
infinite structures is 
JFV x Yy Ex ~ Fy -> x = y) a az Wx Fx e zl 
The preceding example shows that the compactness theorem fails for 
second-order logic. l i 


Theorem 41A There is an unsatisfiable set of second-order sentences 


every finite subset of which is satisfiable. 


Proof The set is, in the notation of the above example, 


fhos Aas Ags e} fa 


The Léwenheim-Skolem theorem also fails for second-order logic. By 
the languange of equality we mean the language (with =) having no param- 
eters other than Y. A structure for this language can be viewed as being 
just a nonempty set. In particular, a structure is determined to within iso- 
morphism by its cardinality. A sentence in this language is therefore deter- 
mined to within logical equivalence by the set of cardinalities of its models 


(called -its spectrum). 


n the second-order language of equal- 


Theorem 418 There isa sentence j 
28o, 


ity which is true in a set iff its cardinality is 


Consider first the conjunction of 


Proof, using concepts from algebra 
d field, further conjoined with the 


- the (first-order) axioms for an ordere 
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second-order sentence expressing the least-upper-bound property (see 
Example 4 of this section). This is a sentence whose models are exactly 
the isomorphs of the real ordered field (i.e., the structures isomorphic 
to the field of real numbers), We now convert the parameters 6, 1, +, +, < 
to variables (individual, function, or predicate as appropriate) which we 
existentially quantify. The resulting sentence has the desired properties. 


Theorem 41C The set of Gödel numbers of valid second-order sentences 
is not definable in N by any second-order formula. 


Here we assume that Gödel numbers have been assigned to second- 
order expressions in a manner like that used before. Although our proof 
applies to the second-order language of number theory, the theorem is 


true for any recursively numbered language having at least a two-place 
predicate symbol. 


Proof Let T? be the second-order theory of M, i.e., the set of second- 
order sentences true in M. The same argument used to prove Tarski’s 
theorem shows that #7? is not definable in N by any second-order for- 
mula. 

Now let @ be the conjunction of the members of Ap with the second-order 
Peano induction postulate (Example 2). Any model of @ is isomorphic to 
N; cf. Exercise 1. Consequently, for any sentence o, 


oe? iff (æ —> ø) is valid. 


Consequently, the set of (Gödel numbers of) validities cannot be defin- 
able lest #7? be. a 


A fortiori, the set of Gödel numbers of second-order validities is not 
arithmetical and not recursively enumerable. That is, the enumerability 
theorem fails for second-order logic. (In the other direction, one can show 
that this set is not definable in number theory of order three, or even of 
order w. But these are topics we will not enter into here.) 

It is interesting to compare the effect of a second-order. universal sentence, 
such as the Peano induction postulate 


VX(X0A Vy(Xy —> XSy) > Vy Xy) 
and the corresponding first-order schema, i.e., the set of all sentences 


(8) A Vyp) + 9(Sy)) > Vy p), 
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where g is a first-order formula having just v; free. If x is a o A z 
Peano induction postulate, then any subset of A| containing 0 = c he 
under SY is in fact all of |W |. On the other hand, if Wis a oe o 
corresponding axiom schema, we can only say that every definable T 
of |Ñ | containing 0% and closed under s" is all of |N |. There may ep 
be undefinable subsets for which this fails. (For example, take pleas 
X of Th (N, 0, S) having Z-chains. Then Y satisfies the above oe . 
schema, but it does not satisfy the second-order induction postulate. 
set of standard points is simply not definable in A.) 


EXERCISES 


4. Show that any structure for the language with parameters V, 0, 
and S that satisfies the sentences 


Sl. Vx Sx Æ 0, . 
52. Vx VWy(Sx ~ Sy => x ~ y), 


and the Peano induction postulate 


VX(XO a Vy(Xy > XSy) > Vy Xy) 


is isomorphic to Ny = (N, 0, S$). 


2, Give a sentence in the second-order language of equality which is 
true in a set iff its cardinality is $. Similarly for N. 


3. Let p be a formula in which only the n-place predicate variable i 
occurs free. Say that an n-ary relation R on |W | is implicitly denned : 
by p iff A satisfies p with an assignment of R to X but yee y 

i i - relation to X. Show that , the se 
with an assignment of any other re v that #1 
of Gödel numbers of first-order sentences true in N, is epu 
in N by a formula without quantified predicate or function variables. 


4. Consider a language (with equality) having the one-place AE 
symbols Z and S and the two-place predicate symbol E. Find a n rat | 
sentence o such that (i) if A is a set for which A NWA = Ø an Ee 
= A U PA, = A, SX = PA, EX = {<a, b> aE b = A}, then 
model of o; and (ii) any model of ø is isomorphic to one of the sort de- 
scribed in (i). 
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§4.2 SKOLEM FUNCTIONS 


A want to show how, given any first-order formula, one can find a logi- 
cally equivalent prenex second-order formula of a very special form: 


universal individual 


existential! quantifiers 
quantifiers 


quantifier-free formula 


Ss . a prenex formula wherein all universal quantifiers are individual ones 
whic follow a string of existential individual and function quantifiers 
In the simplest example, observe that l 


Wx Ay p(x, y) F = AF Yx g(x, Fx). 


In a =’ direction this is easy to see. For the “H” direction, consider 
i structure Y and an assignment function s satisfying Wx Ay p(x, y). We 
now that for any a € |% | there is at least one b € |A | such that 


Fa p, y») [s(x]a)(y]6)I. 


We obtain a function f'on |N | by choosing one such b for each a and tak- 
ing f (a) = b. (The axiom of choice is used here.) Then 


Fa Vx p(x, Fx) BELAN. 


This function f is called a Skok j 
stiuëture M. olem function for the formula Wx Ay in the 


The same argument appli 
1 pplies more generally. As a second 
pose that we begin with the formula lad 


Ay, Vx, Iy Vx Yxa Iys PO, Yar Ya). 
(We have listed only y,, Y2, and yz, but presumably other variables occur 


free in y as well.) Here we alr i 
.) He already have the existenti antifier 
the left. What remains is Serres wars 


Vx Ay, Wx_ Vx5 As Yai, Vos Va). 


This is a special case of the first 
example (with g(x, =W 
WY1» Y2, Ya). It is logically equivalent, as before, . Ye) x Vx; Jy; 


AF, Wx; WX Vx, ays pı » Fx, Ya). 


uss 
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Now we have the existential quantifiers dy, JF, at the left; what remains 


is 
Yx Wx_ V3 Avs pr FoX, Ys). 
By the same reasoning as before, this is logically equivalent to 
AF, Vx, Vx VX Pi, Fox, F3X1X2X3), 


where F, is a three-place function variable. Thus the original formula is 


equivalent to 
dy, IF, AF, Vx Yx Yxa Pr, Fox» FX XoXo). 


For quantifier-free y, this is in the form we desire. 


Skolem Normal Form Theorem For any first-order formula we can find 


a logically equivalent second-order formula consisting of 

(a) first a string (possibly empty) of existential individual and function 
quantifiers, followed by 

(b) a string (possibly empty 
by 

(c) a quantifier-free formula. 


) of universal individual variables, followed 


_ A formal proof could be given using induction, but the preceding example 


illustrates the general method. 
Recall that a universal (V,) formula is a first-order prenex formula all 
œ, where œ is quantifier- 


of whose quantifiers are universal: Vx, Vx °° Wx, 
order prenex formula 


free. Similarly, an existential (4,) formula is a first- 
all of whose quantifiers are existential. 


Corollary 42A For any first-order p, we can find a universal formula 
0 in an expanded language containing function symbols, such that gris 
satisfiable iff 0 is satisfiable. 

By applying this corollary tọ ~p, we obtain an existential formula (with 
function symbols) which. is valid iff ọ is valid. 

Proof Again we will only illustrate the situation by an example. Say 
that pis g i 
l dy, Yx Aye Yxa Vx Iya POr, Ye» Yə) 


First we replace y by the logically equivalent formula in Skolem form: 


dy, 4, APs Vx, VX, VXs POr FX FoX XX). 
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Then for 0 we take 
Vx, Vx Vx yle, Ix » ZX1X2X3), 


where c, f, and g are new function symbols having zero, one, and three 
places, respectively. a 


Corollary 42B In a recursively numbered language having for each 
k > 0 infinitely many k-place function symbols and a two-place predicate 
symbol, the set of Gödel numbers of valid existential (first-order) sentences 
is not recursive. 


Proof Given any sentence ø we can, by applying Corollary 42A to 
“io, effectively find an existential sentence which is valid iff ø is valid. 
Hence a decision procedure for the existential validities would yield a de- 


cision procedure for arbitrary validities, in contradiction to Church’s 
theorem. E 


It is possible to strengthen this corollary by weakening the hypotheses, 
It is not too hard to see that the two-place predicate symbol is unnecessary, 
and that it suffices that there be k-place function symbols for arbitrarily 
large k. (The proof utilizes faithful interpretations.) 

We can use predicate variables instead of function variables in these 
results, but at a price. Suppose we begin with a first-order formula. It 
is equivalent to a formula y in Skolem normal form; suppose for simplicity 
that y = JF p, where p has only individual quantifiers and F is a one- 
place function variable. We can choose g in such a way that F occurs 
only in equations of the form u ~ Ft (for terms ¢ and u not containing 
F). This can be done by replacing, for example, an atomic formula e(Ft) 
by either Vx(x ~ Ft-—>a(x)) or 3x(x ~ Ft a a(x)). 

Next observe that a formula 


Iru ~ Ft__., 
wherein F occurs only in the form shown, is equivalent to 
AX(Vy 3!z Xyz a ___Xtu__). 


If one pursues this question (as we will not do here) one finds that any 
first-order formula is logically equivalent to a second-order formula con- 
sisting of l 


(a) a string of existential predicate quantifiers, followed by 
(b) a string of universal individual quantifiers, followed by 
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(c) a string of existential individual quantifiers, followed by 
(d) a quantifier-free formula. 


There are corresponding versions (which the reader is invited to formulate) 
of Corollaries 42A and 42B. 


EXERCISES 


L Prove the Léwenheim-Skolem theorem in the following improved 
form: Let Ul be a structure for a language of cardinality x. Let S be a subset 
of | % | having cardinality A. Then there is a substructure B of Ù of cardi- 
nality 4 + x such that for any function s mapping the variables into E 
and any (first-order) 9, 


Hapis] iff Fao Is]. 


Suggestion: Choose Skolem functions for all formulas. Close S under the 
functions. 


2. State the two corollaries referred to in the last sentence of Section 
4.2. (The second of these can be compared with Exercise 5(b) of Section 
9.5. where it is shown that the set of Y, validities without function symbols 
is decidable.) 

§4.3 MANY-SORTED LOGIC 


We now return to first-order languages, but with many sorts of variables, 
ranging over different universes. (In the next section this will be applied 
to the case in which one sort of variable is for elements of a universe, another 
for subsets of that universe, yet another for binary relations, and so forth.) 

In informal mathematics one sometimes says things like, “We us Greek 
letters for ordinals, capital script letters for sets of integers, . Seed In ef- 
fect, one thereby adopts several sorts of variables, each sort having its own 
universe. We now undertake to examine this situation precisely. As might 
be expected, nothing is drastically different from the usual one-sorted sit- 
uation. None of the results of this section are at all deep, and most of the 
Po ae a nonempty set J, whose members are called sorts, 


and symbols arranged as follows: 


A. Logical symbols 


0. Parentheses: (, ). 
1. Sentential connective symbols: =, —. 
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2. Variables: For each sort i, there are variables vj, vi, ... of sort i 
3. Equality symbols: For some i € J there may be the symbol =; 
said to be a predicate symbol of sort <i, id. ? 

B. Parameters 


0. Q antifier symbols: For each sort 7 th is a univ 

uan y : ere 18 1 i 

b i i ersal quantifier 

1. Predicate symbols: For each n > 0 and each n-tuple iy i, > 
preety 


of sorts, there is a set (possibly empty) of n-place predicate symbols 
each of which is said to be of sort <i,,..., i,>. 


2. Constant symbols: For each sort i there is a set (possibly empty) 
of constant symbols each of which is said to be of sort i. 

3. Function symbols: For each n > 0 and each (n + J)-tuple <i. 
in» Iny} Of sorts, there is a set (possibly empty) of m-place fonction 
symbols, each of which is said to be of sort <h, 


As usual, we must assume that these categories of symbols are disjoint 
and further that no symbol is a finite sequence of other symbols. 


Each term will be assigned a unique sort. We define the set of terms of 
sort i inductively, simultaneously for all i: 


{. Any variable of sort i or constant symbol of sort i is a term of sort i 


r 2. Tf. a i+, t, are terms of sort i,, ..., Íp, respectively, and fis a func- 
ji symbol of sort <i,,..., ins inyi then ff; -+- ta is a term of sort 
nti’ 


as definition can be recast into a more familiar form. The set of pairs 
<t, i> such that ¢ is a term of sort 7 is generated from the basic set 


{vid in>1&ie Tu {<ce,i> +e is a constant symbol of sort i} 


by the operations which, for a.furiction symbol f of sort <i, bys baa 

: : aed Pao Prise 

sles the pair <ft, +++ fns inp from the pairs <4, i> ty, i 

i ; | Mie tae 3 nZ” 

R chee foina is a sequence Pr, ++» f consisting of a predicate sym- 

ae sort Gh, <.: b and terms t,,..., 4, of sort h, ..., i,, respectively. 

e nonatomic formulas are then formed using the connectives >, —> 

and the quantifiers V,vi,. <= 

A many-sorted structure Ñ is a function on the set of parameters which 
assigns to each the correct type of object: 


1. To the quantifier symbol V,, Ù assi ean 
bo gns a nonempt j ; 
the universe of Ù of sort i. ' EN RAEN EANA 


-eeo lns iny 
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2. To each predicate symbol P of sort <ij,.-+ stn» A assigns the relation 
Ps |W, xe XA fin: 


3. To each constant symbol c of sort /, WU assigns a point c% in |% |,. 
4. To each function symbol f of sort Chis meee las ea? YW assigns a 
function 


SENA a X ote x |W A ina 


tn 
. The definitions of truth and satisfaction are the obvious ones, given that 
Y; is to mean “for all members of the universe |Ù |; of sort i.” 

In a many-sorted structure, the universes of the various sorts might or 
might not be disjoint. But since we have no equality symbols between sorts, 
any nondisjointness must be regarded as accidental. In particular, there 
will always be an elementarily equivalent structure whose universes are 


disjoint. 


Reduction to one-sorted logic 


Many-sorted languages may at times be convenient (as in the following 
section). But there is nothing essential that can be done with them that cannot 
already be done without them. We now proceed to make this assertion 
in a more precise form. 

We will consider a one-sorted language having all the predicate, con- 
stant, and function symbols of our assumed many-sorted language. In 
addition, it will have a ‘one-place predicate symbol Q; for each jin J. There 
is a syntactical translation taking each many-sorted sentence o into a one- 
sorted sentence o*. In this translation all equality symbols are replaced by 
=~. The only other change is in the quantifiers (the quantifier symbols and 
the quantified variables): We replace 


V ht hmm 


by 
WolQw > ——v-—) 


where v is a variable chosen not to conflict with the others. Thus the quan- 
tifiers of sort i are relativized to Q4. 

Turning now to semantics, we can convert a many-sorted structure Y 
into a structure U* for the above one-sorted language. The universe a] 
is the union Uer | U |; of all the universes of Ù. To Q; is assigned the set 
|A |; On the predicate and constant symbols "* agrees with YW. For a 
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Eroen i J, the function f W is an arbitrary extension of f™. (Of 
a t p last. sentence does not completely specify f%. The results we 
give for W* hold for any structure obtained in the manner just described ) 


Lemma 43A A many-sorted sentence ø is true in Y iff o* is true in Y* 


Pete is a makes a stronger statement concerning formulas. 
ent is then proved by induction. - 

Consider now the other direction. A one-sorted structure is not al 
convertible into a many-sorted structure. So we will impose REPA 
ditions, Let ® be the set consisting of the following oe pace 


1. Sv Ow, for each i in Z. 


2. Woy ++ Wv,(Q; vee 
don symbol fol Ci ae a a each func- 
pee aa rt (iy, ». +5 bns inyi. We include the case n = 0, in which 
e the above becomes the sentence Q;c for a constant symbol c of sort i 


Notice that the above Ñ* was a model of Ø. A one-sorted model B of ® 


a an rt B . he con i i i 


| Bt |; = OP; 
ps? — P% n (Q8 x B 
PX ++ X OF) where P is a predicate sym 

o ymbol of sort 

cB! — c9; 

J% = f® A (OB X «+ x OB x OB) i 
‘ aX în X QP.) the restriction of f to QF 
; i x Q8, where fis a function symbol of sort <i, i 
Me > eset Side 2 | 


L : 
Ee oe If Bis a model of ®, then BË? is a many-sorted structure 
ermore, a many-sorted sentence o is true in B? iff o* is true in y. 


The proof is similar to the proof of Lemma 43A 
Notice that B#* is not in general equal to 8. (For example, | 8| may 


contain points not belongi B 
to ging to any QX.) On the other hand, %**# is equal 


Theorem 43C In the many-sorted language 


aEG 


iff in the one-sorted language 


c*U@ bot, 
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Proof (=>) Let 8 be d one-sorted model of Z* U ®. Then B? is a model 
of Z by Lemma 43B. Hence Bt is a model of o. So by Lemma 43B again, 


B is a model of o*. 
(<=) Similar, with Lemma 43A. B 


By using Theorem 43C, we can now infer the following three theorems 
from the corresponding one-sorted results. 


Compactness Theorem If every finite subset of a set x of many-sorted 
sentences has a model, then 2 has a model. 


Proof Assume that every finite subset Dy of & has a many-sorted model 
WY. Then a finite subset XX of X* has the model Xx. Hence by the ordinary 
compactness theorem, y* has a model 8. Bt is then a model of 2. a 


Enumerability Theorem For a recursively numbered many-sorted lan- 
guage, the set of Gédel numbers of valid sentences is recursively enumerable. 


Proof For a many-sorted o, we have by Theorem 43C, 
Eo iff OF o*, 


Since ® is recursive, Cn @ is recursively enumerable. And o* depends re- 
cursively on g, So we can apply Exercise 1(b) of Section 3.5. a 


Léwenheim-Skolem Theorem For any many-sorted structure (for a 
countable language) there is an elementarily equivalent countable structure. 


Proof Say that the given structure is X. Then U* is a one-sorted model 
of (Th* u È. Hence by the ordinary Léwenheim-Skolem theorem, 
(ThID)* U @ has a countable model 8. B is a model of Th and so is 
elementarily equivalent to X. i] 


§4.4 GENERAL STRUCTURES 


We now return to the discussion of second-order logic begun in Section 
4.1. We discussed there (a) the grammar, i.e., the set of wfis for second order, 
and (b) the semantics, i.e., the notion of structure (which was the same as 
for first order) and the definition of satisfaction and truth. 

In this section we want to leave (a) unchanged, but we want to present 
an alternative to (b). The idea can be stated very briefly: We view the lan- 
guage (previously thought of as second-order) now as being a many-sorted 
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elementary (i.e., first-order) language. The result is to make open to inter- 
pretation not only the universe over which individual variables range but 
also the universes for the predicate and function variables. This approach 
is particularly suited to number theory; that case is examined briefly at 
‘the end of this section. . 


The many-sorted language 


Despite the fact that we want ultimately to consider the grammar ‘of 
Section 4.1, it will be expedient to consider also a many-sorted language 
constructed from the second-order language of Section 4.1. We take N, 
sorts: the one individual sort (with variables »,, vz, ...); for each n > 0, 
the n-place predicate sort (with variables X? , X}, ...); and for each n > 0 
the n-place function sort (with variables F?, F}, ...). We will use equality 
(=) only between terms of the individual sort. The predicate and function 
parameters of our assumed second-order language will also be parameters 
of the many-sorted language, and will take as arguments terms of the in- 
dividual sort. 

In addition, we now use two new classes of parameters. For each n > 0 
there is a membership predicate parameter ¢, which takes as arguments one 


term of the n-place predicate sort and n terms of the individual sort. Thus, 


for example, 
EgX 010g 


is a wif. Its intended interpretation is that the triple denoted by (v3, V, va) 
is to belong to the relation denoted by X°. This is exactly the interpretation 
assigned previously to the second-order formula 


XVP Wg, 


and, in fact, the reader is advised to identify these two formulas closely 
in his mind, 

For each n > 0, there is also the evaluation function parameter Ep. 
E, takes as arguments one term of the n-place function sort and n terms of 
the individual sort. The resulting term, 


EF" ty on ths 


is itself of the individual sort. Again the reader is advised to identify closely 
the term E,F"t, +--+ ta with the previous Ft, +++ ta. 

There is an obvious way of translating between the second-order language 
of Section 4.1 and the present many-sorted language. In one direction we 
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stick on the e, and E, symbols; in the other direction we take them off. 
The purpose of these symbols is to make the language conform to Section 
4,3. 

_A many-sorted structure has universes for each sort and assigns suitable 
objects to the various parameters (as described in the preceding section). 
First, we want to show that without loss of generality, we may suppose that 
e, is interpreted as genuine membership and E, as genuine evaluation. 


Theorem 44A Let Ù be a structure for the above many-sorted language 
such that the different universes of Ù are disjoint. Then there is a homo- 
morphism h of Ù onto a structure B such that 

(a) h is one-to-one, in fact the identity, on the individual universe (from 
which it follows that 

Eapts) iff Fao [hos] 
for each formula ¢). 

(b) The n-place predicate universe of 8 consists of certain n-ary relations 
over the individual universe, and <R, a), ..., > is in ER iff <da, ooog An 


ER. . . 
(c) The n-place function universe of B consists of certain n-place functions 
on the individual universe, and E®(f, dis -<>> an) =S (as 0 an). 


Proof Since the universes of Ù are disjoint, we can define h on one uni- 
verse at a time. On the individual universe U, A is the identity. On the uni- 
verse of the n-place predicate sort, 


hO) = {a,, -.., Gy: Each a; is in U and <Q, 0i, > an> is in e) 
Thus 
(1) CETE 


Similarly, on the universe of the n-place function sort, 


iff <Q, a,,...,@,> is in ef. 


s an? € R(Q) 


h(g) is the n-place function on U whose 
value at (a,,..., An? is EX(g, ay, «++» An). 
Thus l 
(2) hga, prey an) = EXE, Ay, vers Ay). 
For %8 we take simply the membership relation, 


(3) CR, ay, -.-,4, is in cP iff Kays. ay ER. 
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For EË we take the evaluation function, 


(4) ERS Gags =F Gy 5 24 Ap). 


Then it is clear upon reflection: that / is a homomorphism of Y onto Y 


That h preserves e, follo 
i ws from (1) and (3), where in (3) we 
hal ; take R = 
a from (2) and (4) it follows that h preserves E i ore 
; ne 
Pio ee we have to verify the parenthetical remark of part (a). This 
ws from the many-sorted analog of the homomorphism eoim of 


why S g ac y l 


B 3 

; 7 = a eine we can restrict attention to structures 8 in which 
a n are fixed by (b) and (c) of the theor i 

; orem. But since e8 and EP ar 
etermined by the rest of B, we really do not need them at all When an 


3 a g r 


General structures for second-order languages 


These structures i i 
These provide the alternative semanti j 
beginning of this section. R 


Definiti 
efinition A general pre-structure Ù for our second-order language 


consists of a structure (in the origi : 
sets: ( original sense), together with the additional 


(a) for each n > 0 

, an n-place relation univers ich i 
rse i 
eton on TET. se, which is a set of n-ary 


(b) for each n > ’ n -p N 
0 ann lace function universe, which sa set of functions 


Y is a general structure if, i iti 
in addition i 
ie a 3 , all comprehension sentences are 


The I ie ; 
ae teen of the definition requires explanation. Firsi, a compre- 
sentence is a sentence obtained as zati h 
hension formula (s s a generalization of a compre- 
ee Example 3 i wes 
form p of Section 4.1). Thus it is a sentence of the 


Vya e Vym AX” Woy + V0, (Xo, eoe 0, p), 
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where X” does not occur free in p, or 


Yy, RER VYalY 4 e Vin Any g => 
I Vo, ee Von eee Vn Vay F p), 


where F” does not occur free in p. 
Next we must say what it means for a comprehension sentence (or any 


second-order sentence for that matter) to be true in A. Assume then that N 
is a general pre-structure. Then a sentence o is true in U iff the result of ` 
converting o into a many-sorted sentence (by adding £, and Ep) is true in 
N, with e, interpreted as membership and E, as evaluation. 

More generally, let p be a second-order formula, and let s be a func- 
tion which assigns to each individual variable a member of |% |, to each 
predicate variable a member of the relation universe of Y, and to each func- 
tion variable a member of the function universe of A. Then we say that Di 
satisfies p with s, = p [s], iff the many-sorted version of p is satisfied with 
sin the structure Ù, where €p is interpreted as membership and E, as evalua- 
tion. 

The essential consequences of this definition of satisfaction are the fol- 
lowing, which should be compared with 5 and 6 of page 269. 


LG WX" [s] iff for every R in the n-place relation 
universe of X, Es o [s(X"|R)]- 
Ee vrro fs] iff for every f in the n-place function 


universe of YX, Hip E” AN. 


This, then, is the alternative approach mentioned at the beginning of 
treating the second-order grammar as being a many- 


the section. It involves 
sorted first-order grammar in disguise. Because this approach is basically 


first-order, we have the Löwenheim-Skolem theorem, the compactness | 
theorem, and the enumerability theorem. 


Theorem If the set 2 of sentences in a countable 


Léwenheim-Skolem 
a general model, then it has a countable general 


second-order language has 
model. 


Here a countable general model is one in which every universe is count- 
able (or equivalently, the union of all the universes is countable). 


Proof Let T' be the set of comprehension sentences. Then X UT, viewed 
has a countable many-sorted model by ~ 


as a set of many-sorted sentences, 
preceding section. By Theorem 44A, 


the Lowenheim-Skolem theorem of the 
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‘a homomorphic image of that i 
model is a general pre-st ri isfyi 
& U T, and hence is a general model of 2. ENS 


T Theorem If every finite subset of a set & of second 
order sentences has a general model, then & has a general model i 


Proof The proof is exactly as above. Every finite subset of X UI" has 


a many-sorted model, so we can appl 
> the 
preceding section. PRI aa a theorem of the 


E K4 Py 
Meccan Theorem Assume that the language is recursively num 
. Then the set of Gödel numbers of seco’ 
ered econd-order sentences which 
true in every general structure is recursively enumerable “ 


Proof A i i 
mek if sentence o is true in every general structure iff it is a many- 
ed consequence of J’. And #4’ is recursive. a 


T », 
Eeo a T assure us that there is an acceptable deductive 
sh that r is deducible from Z iff r is true i 
A le fi rue in every general model of 
e beginning of Section 2.4). B 
vaca en tare i .4). But now that we know 
s plete deductive calculus, there i i 
the detailed development of one. í aii 
we: 
Rees 7: a > two approaches to second-order semantics as fol- 
: ersion of Section 4.1 (which we will call f 
Ce ( call absolute second-order 
s ure, in which the meaning of the par i 
open to interpretation by structure i a 
s, but the notion of bei for 
a subset is not left open, but i a 
| , but is treated as having a fixed i 
version of the present section ( Se ae 
general second-order logic) id 
ing to a fixed’notion of subset, and i eee a 
i : consequently is reducible to first-or 
logic. In this respect it is li i i pene 
i pect it is like axiomatic set theor h 
sets and sets of sets and so fo ae 
rth, but the theory i f 
: l 5 y is a first-order theory. 
i ina ae a es of structures, general second-order logic annie 
which logical implication holds. That is, i 
; a ; t is, if ever ] 
of X is a general model of o, i er e 
é o, it then follows that X E o in 
2 absolute second- 
aS logic. But the converse fails. For example, take Y = @: The ese 
S m . . : ee : 
ee ences true in all general models is a recursively enumerable subset of 
nonarithmetical set of valid sentences of absolute second-order logic 


Models of analysis 


We can illustrate the ideas of this section by focusing attention on the 
most interesting special case: general models of second-order aube 
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theory. Consider then the second-order language for number theory, with 
the parameters 0, S, <, -, and E. We take as our set of axioms the set A} 
obtained from Ay by adding as a twelfth member the Peano induction pos- 
tulate (Example 2, Section 4.1). From Exercise 1 of Section 4.1, we can 
conclude that any model (in the semantics of that section) of A% is iso- 
morphic to %. l 

.But what of the general models of our axiom set? They can differ from 
N in either (or both) of two ways. We can employ the compactness theorem 
as before to construct (nonstandard) general models of the axioms having 
infinite numbers (i.e., models 9{ with an element a which is larger than the 
denotation of S”0 in the ordering <¥), We can also find (nonabsolute) 
general models in which, for example, the set universe (the unary relation 
universe) is less than the full power set of the individual universe. Indeed, 
any countable general model must be of this kind. 

It is traditional for logicians to refer to second-order number theory as 
analysis. The name derives from the fact that it is possible to identify real 
numbers with sets of natural numbers. In second-order number theory we | 
have quantifiers over sets of natural numbers, which we can view as quan- . 
tifiers over real numbers. The appropriateness of the naine is nevertheless 
open to question, but its usage is well established. By a model of analysis 
we will mean a general model of the above axiom set Af. . 

Define an w-model of analysis to be a model of analysis in which the in- 
dividual universe is N and the denotations of 0 and S are the genuine 0 
and S. (Consequently, the denotations of <, +, +, and E are also standard.) 
The motivation for studying w-models can be stated as follows: We have a 
clear understanding of the set N. But we do not have anything like the same 
understanding of its power set @N. For example, we may be uncertain 
whether its cardinality is $$, or &, or more. So it is reasonable to hold fixed 
that which we are sure of (N), but to leave open to interpretation by a struc- 
ture that which we are not sure of (PN). 

Among the w-models of analysis there is the one absolute model, whose 
n-place relation universe consists of all n-ary relations on N (and whose 
function universes consist of all possible functions). A first-order sentence 
is true in an arbitrary w-model of analysis iff it is true in N. But the w-model 

may disagree with the absolute model on second-order sentences. 

In the next theorem we assert that an œw-model of analysis is completely 
determined by its set universe (i.e., its one-place relation universe). 


Theorem 44B IFA and B are w-models of analysis having the same one- 
place relation universe, then Y= 8. 
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Proof Suppose R belongs to the three-place relation universe of A. 
Let {R} be the “compression” of R into a unary relation: 


CR} = {a, b, c) : <a, b, c) € R}. 


Our sequence-encoding function is recursive and hence is first-order de- 
finable in number theory by a formula y. RY is in the set universe of Ñ 
by virtue of the comprehension sentence 


VX? 3X VulX'u <> Jv, Jv, Ivglplo, v2, V3, U) A Xv Pv). 


Thus (RY is in the set universe of B; we unpack it by a similar argument. 
R is in the three-place relation universe of 8 by virtue of the comprehension 
sentence , 


VX? AX? Wo, Wo, Voal X vDo <> Au(y(v,, Vos Vg, uv) A Xn). 
A similar argument applies to the function universes. a 


Consequently, we can identify an w-model of analysis with its set universe 
(which is included in9’N). Not every subclass of WN is then an w-model of 
analysis, but only those for which the comprehension sentences are satisfied. 


EXAMPLES of w-models We need only specify the set universe. 


1. PN is the absolute model. 

2, Let (A, €4) be a model of the usual axioms for set theory such that 
(i) the relation €4 is the genuine membership relation {<a, b> i:a € A, 
b € A, and a € b} on the universe A, and (ii) A is transitive, i.e., if a € b € A, 
then a € A, Then the collection of all those subsets of N that belong to A 
is an w-model of analysis. 

3. For a class. x < PN, define DX to be the class of all sets BC N 
which are definable in the w pre-structure with set universe.” by a formula 
of the language of second-order number theory, augmented by parameters 
for each set in Z. Then define by transfinite recursion on the ordinals: 


My = D, 
Arn = DL, 
WM, = |) %, for limit 2. 
a<r 


By cardinality considerations we see that this ‘stops growing at some ordinal 
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‘ees si from 
be the least such £; it can be shown ( 
OER table ordinal. Vp, 


and is called 


r which Sga = 
ie I A Skolem theorem) that Bo is a eet ae 
coincides with Ua A, (the union being over all or on 2 sane 
the class of ramified analytical sets. It is an @-model o rae : i ooh 
of the comprehension sentences follows from the fact tha Bp E By 
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Alphabetic variants, 119-120 
Analysis, nonstandard, 164-173 
Arithmetic, see Number theory 
Arithmetical hierarchy, 235-238 
Arithmetical relations, 235 
Atomic formulas, 73 
Automorphisms, 93 
Axiomatizable theories, 146-147 


B 


Biconditional, 17 

Boolean functions, 45-47 
Bound variables, 79 

Bounded quantifiers, 202, 238 


C 


Calculi, see Deductive calculi 
Cantor, Georg, 11 

Cantor’s theorem, 150 

Cardinal arithmetic theorem, 12 
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Categorical sets, 143, 270 
Categoricity in power, 147 
Chinese remainder theorem, 248 
Church’s theorem, 231, 242 
Church’s thesis, 199-201, 226, 233 
Circuits, 53-58 
Compactness theorern, 101, 136, 139 
in many-sorted logic, 281 
in second-order logic, 271, 286 
in sentential logic, 34, 59-60 
Complete sets of connectives, 49 
Complete theories, 145, 147, 226 
Completeness theorem, 124, 128 
Comprehension formulas, 270 
Computable functions, 200-201, 255 
Computing machinery, 16, 53-58, 252-253, 
see also Register machines 
Concatenation function, 215 
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Congruence relations, 134 
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Construction sequences, 23 
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Countable sets, 9 


D 


De Morgan’s laws, 37 
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thesis 
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Deduction theorem, 111 
Deductions, 103 
Deductive calculi, 101--108 
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Descriptions, see Defined function sym- 
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Disjunctive normal form, 48-49 
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Duality, 38 
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Effective enumerability, 61-62, see also 
Recursively enumerable relations 
Effective procedures, 60-61, see also 
Church’s thesis 
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Elementary equivalence, 92 
Eliminable definitions, 163 
Elimination of quantifiers, 181 
Enumerability theorem, 101, 137-139 
in many-sorted logic, 281 
in second-order logic, 272, 286 
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Finitely axiomatizable theories, 146 
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Fixed-point lemma, 227 
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Formula-building operations, 20 

Formulas, see Well-formed formulas 

Free variables, 74-75 
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Frege, Gottlob, 141 

Function symbols, 68 

Function variables, 269 
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General structures, 284 
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Generalization theorem, 109 
Generated sets, 25 
Gödel, Kurt, 128, 139, 141, see also Gödels 
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Gödel B-function, 248 
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Halting problem, 259 
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Homomorphism theorem, 91 
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Identity interpretation, 158 
iff, 4 
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incompleteness theorem, see Gödels in- 
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Independent axiomatizations, 38 l 
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Jsomorphisms, 90 
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Kleene normal form, 254, 257 


L 


Languages, see First-order languages, For- 
mal languages, Second-order languages 

Least-zero operator, 208 

Lindenbaum’s theorem, 239 
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"Logical equivalence, 83 
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Monotone recursion, 216 
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Negation, 14, 17 
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Nonstandard analysis, 164-173 
Nonstandard models, 175, 287 
Normal form theorem, 254, 257 
Number theory, 175 
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Parameterization theorem, 260, 266 
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Partial functions, 255 
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Peano induction postulate, 183, 270, 272, 
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Presburger’s theorem, 188 
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Proofs, see Deductions 
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